We focus on making AI smaller, faster, and more efficient through full-stack innovations:
- 🧠 Algorithm: Designing efficient model architectures and approximations (e.g., sparsity, compression).
- ⚙️ System: Building hardware-aware system support to accelerate emerging AI workloads.
- 🚀 Application: Working with real-world use cases in generative AI, robotics, and scientific discovery.
We are affiliated with the UCSD ML Systems Group.
News
- SparseLoRA is accepted to ICML 2025!