Luo Mai

Luo Mai

Assistant Professor

University of Edinburgh

About Me

I am an Assistant Professor in the School of Informatics at the University of Edinburgh, where I lead the Edinburgh Large-Scale Machine Learning Systems Group. Starting in 2024, I also co-lead the UK EPSRC Centre for Doctoral Training in Machine Learning Systems and an ARIA project focused on modeling and scaling AI systems.

My research interests lie at the intersection of computer systems, machine learning, and data management. My work has resulted in award-winning computer systems, recognized at top conferences such as OSDI, SOSP, NSDI, VLDB, JMLR, ICML, NeurIPS and ECCV. I have received awards from Google, Microsoft, Alibaba, and Tencent. I am the author of the open-source textbook Machine Learning Systems: Design and Implementation and co-founder of several popular open-source projects, such as TensorLayer, TorchOpt, and ServerlessLLM.

Before joining Edinburgh, I was a research associate at Imperial College London, working with Peter Pietzuch and a visiting researcher at Microsoft Research. My PhD, supervised by Paolo Costa and Alexander L. Wolf, was supported by a Google Fellowship in Cloud Computing.

If you’re interested in pursuing a PhD or postdoctoral research with me, please email your CV along with a brief description of your proposed research.

Interests

  • Computer Systems
  • Machine Learning
  • Data Management

Education

  • PhD in Computer Science, 2018

    Imperial College London, UK

  • MRes in Advanced Computing, 2012

    Imperial College London, UK

Publications

(2024). Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor Collections. In SOSP.

PDF

(2024). ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models. In OSDI.

PDF Code

(2023). TorchOpt: An Efficient Library for Differentiable Optimization. In JMLR.

PDF Code

(2023). GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models. In ICML.

PDF Code

(2022). Ekko: A Large-Scale Deep Learning Recommender System with Low-Latency Model Update. In USENIX OSDI.

PDF

(2022). A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning. In NeurIPS.

PDF

(2022). MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment. In ECCV.

PDF Code

(2021). Move Fast and Meet Deadlines: Fine-grained Real-time Stream Processing with Cameo. In USENIX NSDI.

PDF

(2021). Efficient Reinforcement Learning Development with RLzoo. In ACM Multimedia (Open-source Software Competition).

PDF Code

(2021). Fast and Flexible Human Pose Estimation with HyperPose. In ACM Multimedia (Open-source Software Competition).

PDF Code

Software

ServerlessLLM

Serverless LLM serving for everyone. GitHub stars

MegBA

A GPU-Based Distributed Library for Large-Scale Bundle Adjustment. GitHub stars

TorchOpt

An efficient library for differentiable optimization built upon PyTorch. GitHub stars

Quiver

PyTorch Library for Low-Latency, High-Throughput Graph Learning on GPUs. GitHub stars

KungFu

Adaptive Large-scale Deep Learning GitHub stars

TensorLayer

Easy-to-use Deep Learning Library GitHub stars

HyperPose

Real-time Visual Computing Library GitHub stars

RLzoo

Reinforcement Learning Model Zoo GitHub stars

Group

Researchers

Xuan Sun

Research Associate

Grad Students

Leyang Xue

PhD Student (Primary supervisor Mahesh Marina)

Yao Fu

PhD Student (Winner of 2024 Rising Star in ML & Systems)

Man-Kit Sit

PhD Student

Congjie He

PhD Student

Yeqi Huang

PhD Student

Maria Durackova

PhD Student (Primary supervisor Boris Grot)

Matej Sandor

PhD Student

Teaching

Course designer and organizer for popular courses (150+ students) at Edinburgh:

Service and Awards

Selected Research Awards and Grants

  • ARIA project for scaling AI compute, 2024
  • Microsoft Research Asia StarTrack Scholar Award, 2024
  • EPSRC CDT for ML Systems, 2024
  • Chancellor Rising Star in Research (Finalist), 2023
  • Tencent Research Award, 2022
  • Alibaba Innovative Research Award, 2020
  • Microsoft Azure Research Award, 2018
  • ACM Multimedia Best Open-Source Software Award, 2017
  • Google PhD Fellowship in Cloud Computing, 2012 - 2016
  • ACM CoNEXT Conference Best Paper Finalist, 2014
  • IEEE MASS Conference Best Paper Finalist, 2012

Conference Organization:

Selected Conference Committee Memberships:

  • EuroSys (2025)
  • ICDE (2021-2025)
  • SoCC (2023-2024)
  • MICRO (2022)

Contact

  • luo.mai@ed.ac.uk
  • IF-2.03, Informatics Forum, University of Edinburgh, Edinburgh, EH8 9AB