News — Luo Mai

July 5, 2026

Two papers accepted to SOSP 2026

Wavel and MeshRT were accepted to the ACM Symposium on Operating Systems Principles.

June 27, 2026

Ryze published at ACL 2026 System Demonstrations

Ryze synthesizes evidence-enriched biomedical VLM training data from papers.

March 20, 2026

BatchGen accepted to OSDI 2026

BatchGen targets throughput-first inference for very large MoE-style models.

February 20, 2026

ContextPilot accepted to MLSys 2026

ContextPilot speeds long-context inference through context reuse.

November 25, 2025

BitDecoding accepted to HPCA 2026

BitDecoding accelerates low-bit KV-cache inference by unlocking Tensor Cores.

September 25, 2025

MoE-CAP accepted to NeurIPS 2025

MoE-CAP appears in the Dataset and Benchmark Track for mixture-of-experts evaluation.

August 25, 2025

Award for AI4Math systems research

New funding to build systems that support AI-driven mathematical discovery.

July 1, 2025

WaferLLM accepted to OSDI 2025

WaferLLM is a wafer-scale LLM inference system designed beyond GPU assumptions.

March 25, 2025

Promoted to Reader at Edinburgh

Promotion to Reader (Associate Professor) starts in August 2025.

October 26, 2024

ARIA award for scaling AI compute

Imperial, Edinburgh, and Cambridge team up on modular AI systems simulation.

October 25, 2024

Tenplex accepted to SOSP 2024

Tenplex enables elastic LLM training with multi-dimensional parallelism.

August 10, 2024

Yao Fu wins Rising Star in ML and Systems

Yao was selected from a global cohort for systems-focused ML research.

July 10, 2024

ServerlessLLM accepted to OSDI 2024

A checkpoint-aware serverless inference system for low-latency LLM deployment.

May 10, 2024

Microsoft Research StarTrack Scholar Award

Received MSR recognition for early-career computer systems research.

January 20, 2024

ML Systems CDT funded by EPSRC and industry

Edinburgh launched a doctoral training centre spanning the AI systems stack.

December 10, 2023

TorchOpt accepted to JMLR

TorchOpt also became a PyTorch Ecosystem Project after two years of incubation.

November 10, 2023

Chancellor's Rising Star finalist at Edinburgh

Recognized through school and college-level selection for rising research leadership.

September 10, 2023

GEAR accepted to ICML 2023

GEAR bridges reinforcement learning and LLM training workflows.

December 18, 2022

Ekko accepted to OSDI 2022

Ekko unifies training and inference for low-latency recommender model updates.

July 10, 2022

MegBA accepted to ECCV 2022

MegBA is a GPU-based distributed library for large-scale bundle adjustment.

July 5, 2021

Cameo accepted to NSDI 2021

Cameo enables fine-grained real-time stream processing with deadline awareness.

August 18, 2020

KungFu accepted to OSDI 2020

KungFu makes distributed ML training adaptive and performance-portable.

November 5, 2019

Invited SOSP AI Systems workshop talk

Talked on adaptive distributed training for deep learning models.