Luo Mai

Luo Mai

Assistant Professor

University of Edinburgh

About Me

I am an Assistant Professor in the School of Informatics at the University of Edinburgh, leading the Large-Scale Machine Learning Systems Group and co-directing the UK EPSRC Centre for Doctoral Training in Machine Learning Systems.

My research focuses on creating scalable, efficient, and reliable computer systems for machine learning and data management. This work has been recognized at major conferences such as OSDI, USENIX ATC, NSDI, CoNEXT, JMLR, ICML, NeurIPS, ECCV, and VLDB, earning awards from Google, Microsoft, Alibaba, and Tencent. I authored the open-source textbook Machine Learning Systems: Design and Implementation and co-founded several popular open-source projects including TensorLayer, HyperPose, TorchOpt and ServerlessLLM.

Prior to Edinburgh, I served as a research associate at Imperial College London, working under Peter Pietzuch and was a visiting researcher at Microsoft Research. My PhD, supervised by Paolo Costa and Alexander L. Wolf, was supported by a Google Fellowship in Cloud Computing.

If you are interested in pursuing a PhD or postdoctoral research with me, please send an email with your CV and a brief description of your proposed research.

Interests

  • Computer Systems
  • Machine Learning
  • Data Management

Education

  • PhD in Computer Science, 2018

    Imperial College London, UK

  • MRes in Advanced Computing, 2012

    Imperial College London, UK

Publications

(2024). MoE-Infinity: Activation-Aware Expert Offloading for Efficient MoE Serving. In Arxiv.

PDF Code

(2024). ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models. In OSDI.

PDF Code

(2023). TorchOpt: An Efficient Library for Differentiable Optimization. In JMLR.

PDF Code

(2023). GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models. In ICML.

PDF Code

(2023). Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness. In Arxiv.

PDF Code

(2022). Ekko: A Large-Scale Deep Learning Recommender System with Low-Latency Model Update. In USENIX OSDI.

PDF

(2022). A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning. In NeurIPS.

PDF

(2022). MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment. In ECCV.

PDF Code

(2021). Move Fast and Meet Deadlines: Fine-grained Real-time Stream Processing with Cameo. In USENIX NSDI.

PDF

(2021). Efficient Reinforcement Learning Development with RLzoo. In ACM Multimedia (Open-source Software Competition).

PDF Code

Software

MegBA

A GPU-Based Distributed Library for Large-Scale Bundle Adjustment. GitHub stars

TorchOpt

An efficient library for differentiable optimization built upon PyTorch. GitHub stars

Quiver

PyTorch Library for Low-Latency, High-Throughput Graph Learning on GPUs. GitHub stars

KungFu

Adaptive Large-scale Deep Learning GitHub stars

TensorLayer

Easy-to-use Deep Learning Library GitHub stars

HyperPose

Real-time Visual Computing Library GitHub stars

RLzoo

Reinforcement Learning Model Zoo GitHub stars

Group

Researchers

Xuan Sun

Research Associate (Co-hosted with Boris Grot)

Grad Students

Leyang Xue

PhD Student (Co-supervised with Mahesh Marina)

Yao Fu

PhD Student

Man-Kit Sit

PhD Student

Congjie He

PhD Student

Yeqi Huang

PhD Student (Co-supervised with Boris Grot)

Teaching

Service and Awards

Research Awards

  • Microsoft Research Asia StarTrack Scholar Award, 2024
  • Chancellor Rising Star in Research (Nominated), 2023
  • Tencent Research Award, 2022
  • Alibaba Innovative Research Award, 2020
  • Microsoft Azure Research Award, 2018
  • ACM Multimedia Best Open-Source Software Award, 2017
  • Google PhD Fellowship in Cloud Computing, 2012 - 2016
  • ACM CoNEXT Conference Best Paper Finalist, 2014
  • IEEE MASS Conference Best Paper Finalist, 2012

Conference Committee Member: EuroSys (2025), EuroMLSys (2024), ICDCS (2024), SoCC (2023), APSys (2023), ICDE (2021 - 2023), MICRO (2022)

Contact

  • luo.mai@ed.ac.uk
  • IF-2.03, Informatics Forum, University of Edinburgh, Edinburgh, EH8 9AB