Page not found
Perhaps you were looking for one of these?
Latest
- WaferLLM: Large Language Model Inference at Wafer Scale
- [03/25, Achievement] Starting August 2025, I’ll be Reader (Associate Professor).
- [03/25, Paper] WaferLLM, the world fatest LLM inference system, has been accepted to OSDI 2025.
- [10/24, Grant] Secured a prestigious ARIA grant with Imperial College & Cambridge University.
- [10/24, Paper] Tenplex, the first elastic LLM system, accepted to SOSP 2024.
- ServerlessLLM
- [07/24, Student Achievement] Congrats to Yao Fu on Winning 2024 Rising Star in ML & Systems.
- Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor Collections
- [07/24, Paper] ServerlessLLM, the first serverless LLM system, accepted to OSDI 2024.
- Learning high-frequency functions made easy with sinusoidal positional encoding