Luo Mai

Luo Mai

Associate Professor

University of Edinburgh

About Me

I am a Reader (Associate Professor) in the School of Informatics at the University of Edinburgh, where I lead the Large-Scale Machine Learning Systems Group. I also co-lead the UK EPSRC Centre for Doctoral Training in Machine Learning Systems and a UK ARIA Project on Scaling AI Compute.

My research group works across the full stack of AI systems – from models and data to software and infrastructure – pursuing co-designs that aim to achieve a 1000× leap in efficiency, scalability, and reliability. My work has been recognized with several awards and rising-star honors from both academia and industry. I am passionate about open-source and knowledge exchange. I co-founded open-source AI system libraries that have collectively received over 20,000 GitHub stars and co-edited the open-source textbook Machine Learning Systems: Design and Implementation.

Before joining Edinburgh, I was a Research Associate at Imperial College London, working with Peter Pietzuch, and a Visiting Researcher at Microsoft Research. My PhD, supervised by Paolo Costa and Alexander L. Wolf, was supported by a Google Fellowship in Cloud Computing.

Interests

  • Computer Systems
  • Machine Learning
  • Data Management

Education

  • PhD in Computer Science, 2018

    Imperial College London, UK

  • MRes in Advanced Computing, 2012

    Imperial College London, UK

Publications

(2025). MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems. In NeurIPS.

PDF

(2025). WaferLLM: Large Language Model Inference at Wafer Scale. In OSDI.

PDF Project

(2024). Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor Collections. In SOSP.

PDF

(2024). Learning high-frequency functions made easy with sinusoidal positional encoding. In ICML.

PDF Code

(2024). ServerlessLLM: Low-Latency Serverless Inference for Large Language Models. In OSDI.

PDF Code Project

(2023). TorchOpt: An Efficient Library for Differentiable Optimization. In JMLR.

PDF Code

(2023). GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models. In ICML.

PDF Code

(2022). Ekko: A Large-Scale Deep Learning Recommender System with Low-Latency Model Update. In USENIX OSDI.

PDF

(2022). A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning. In NeurIPS.

PDF

(2022). MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment. In ECCV.

PDF Code

Software

ServerlessLLM

Serverless LLM serving for everyone. GitHub stars

MegBA

A GPU-Based Distributed Library for Large-Scale Bundle Adjustment. GitHub stars

TorchOpt

An efficient library for differentiable optimization built upon PyTorch. GitHub stars

Quiver

PyTorch Library for Low-Latency, High-Throughput Graph Learning on GPUs. GitHub stars

KungFu

Adaptive Large-scale Deep Learning GitHub stars

TensorLayer

Easy-to-use Deep Learning Library GitHub stars

HyperPose

Real-time Visual Computing Library GitHub stars

RLzoo

Reinforcement Learning Model Zoo GitHub stars

Group

Researchers

Xuan Sun

Research Associate

Cheng Deng

Research Fellow

Haocheng Xiao

Research Associate

Grad Students

Leyang Xue

PhD Student (Primary supervisor Mahesh Marina)

Man-Kit Sit

PhD Student

Yao Fu

PhD Student

Congjie He

PhD Student

Yeqi Huang

PhD Student

Maria Durackova

PhD Student (Primary supervisor Boris Grot)

Matej Sandor

PhD Student

Dayou Du

PhD Student (Primary supervisor Jianyi Cheng)

Zhan Lu

PhD Student

Yi-Chieh Wang

PhD Student

Yinsicheng Jiang

MScR Student

Tairan Xu

PhD Student

Yangsheng Deng

PhD Student

Teaching

Course designer and organizer for popular courses (150+ students) at Edinburgh:

Service and Awards

Selected Research Awards and Grants

  • Winner of AI for Math fund (Renaissance Philanthropy), 2025
  • ARIA project for benchmarking AI evolution, 2025
  • ARIA project for scaling AI compute, 2024
  • Microsoft Research Asia StarTrack Scholar Award, 2024
  • EPSRC CDT for ML Systems, 2024
  • Chancellor Rising Star in Research (Finalist), 2023
  • Tencent Research Award, 2022
  • Alibaba Innovative Research Award, 2020
  • Microsoft Azure Research Award, 2018
  • ACM Multimedia Best Open-Source Software Award, 2017
  • Google PhD Fellowship in Cloud Computing, 2012 - 2016
  • ACM CoNEXT Conference Best Paper Finalist, 2014
  • IEEE MASS Conference Best Paper Finalist, 2012

Conference Organization:

Selected Conference Committee Memberships:

  • ASPLOS (2026)
  • EuroSys (2025)
  • ICDE (2021-2025)
  • SoCC (2023-2025)
  • MICRO (2022)

Contact

  • luo.mai@ed.ac.uk
  • IF-2.03, Informatics Forum, University of Edinburgh, Edinburgh, EH8 9AB