Posts

TorchOpt and Gear to appear in NeurIPS and ICML

We have three papers to appear in leading machine learning conferences: “GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models” to appear in ICML 2023. “A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning” appeared in NeurIPS 2022.

MegBA in ECCV 2022

Our Paper “MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment” is accepted by ECCV 2022.

Ekko in OSDI 2022

Our Paper “Ekko: A Large-Scale Deep Learning Recommender System with Low-Latency Model Update” is accepted by USENIX Symposium on Operating Systems Design and Implementation (OSDI) 2022. OSDI brings together professionals from academic and industrial backgrounds in what has become a premier forum for discussing the design, implementation, and implications of systems software.

Quiver is open-sourced

Quiver is a distributed graph learning library for PyTorch Geometric (PyG). Its excellent performance and scalability has made it quickly become the recommended distributed library for PyG.

HyperPose and RLzoo in open-source software competition

HyperPose and RLzoo are both accepted to the Open-source Software Competition in ACM Multimedia 2021. ACM Multimedia is the worldwide premier conference and a key world event to display scientific achievements and innovative industrial products in the multimedia field.

Invited lecture at the Oxford ML summer school

I am honored to give a lecture about AI/ML systems at the prestigious Oxford Machine Learning Summer School. The school covers some of the most important topics in ML/DL that the field is showing a growing interest in (e.

Invited talks at TikTok, SenseTime, Tencent and SIGMOD.

I am excited to give talks about future AI/Data systems at TikTok, SenseTime, Tencent and SIGMOD Enterprise Database Workshop 2021.

KungFu to appear at OSDI 2020

Our Paper “KungFu: Making Training in Distributed Machine Learning Adaptive” is accepted by USENIX Symposium on Operating Systems Design and Implementation (OSDI) 2020. OSDI brings together professionals from academic and industrial backgrounds in what has become a premier forum for discussing the design, implementation, and implications of systems software.

Cameo to appear at NSDI 2021

Our Paper “Move Fast and Meet Deadlines: Fine-grained Real-time Stream Processing with Cameo” is accepted by USENIX Symposium on Networked Systems Design and Implementation (NSDI) 2021. NSDI focuses on the design principles, implementation, and practical evaluation of networked and distributed systems.

Invited talk in the AI systems workshop at SOSP 2019

I am invited to give a talk: “Adaptive Distributed Training of Deep Learning Models” in the Workshop on AI Systems at ACM Symposium on Operating Systems Principles (SOSP) 2019.