Tao Luo

I am a final-year CS Ph.D. candidate at the University of Pennsylvania advised by Profs. Boon Thau Loo and Vincent Liu.

My research advances GPU scheduling for AI infrastructure, focusing on agentic reinforcement learning (RL) and LLM serving. I enabled GPU-sharing for concurrent RL jobs (supporting multi-LoRA and full fine-tuning) in the ROLL framework during my research internship at Alibaba.

Previously at Columbia University (M.S.), I coined privacy budget scheduling, leading the first study on scheduling ML training under differential privacy constraints. I was advised by Prof. Asaf Cidon and collaborated broadly with Profs. Ethan Katz-Bassett, Ryan Stutsman, Mathias Lécuyer, and Roxana Geambasu.

Before academia, I developed quantitative investment algorithms in the financial industry. I hold a B.S. in Financial Mathematics from Southern University of Science and Technology, as a member of its founding cohort.

Selected Projects

GPU Scheduling for Agentic RL @Alibaba

  • Proposed a Partial Time-Sharing GPU scheduling algorithm for agentic RL job.
  • Extended the scheduling logic and enabled multi-LoRA training.
  • Redesigned the architecture and scheduling logic for multi-tenant full fine-tuning.
  • Deployed in production (100B+ parameters, 1000+ GPUs): Amap (travel planning), iFlow CLI (coding), Qoder IDE (coding), Alimama (ads).
  • Developed an AI-assisted coding methodology for software development (English/Chinese).

GPU Multiplexing for Heterogeneous LLM Serving @UPenn

  • Eliminated head-of-line blocking via novel LLM serving architecture, raising token throughput by 1.6×.
  • Built efficient multi-model KV cache management and robust NCCL concurrency controls.
  • Optimized sharding, replication, placement, and scheduling strategies.
  • SoCC’25 paper

Privacy Budget Scheduling in ML Training @Columbia

  • Scheduled more jobs than FCFS under identical privacy budgets.
  • Proposed a dynamic algorithm DPF (Dominant Private Block Fairness) based on DRF (dominant resource fairness).
  • Developed rigorous proofs for the game-theory properties of the new algorithm.
  • OSDI’21 paper

Honors & Service

  • Program Committee: ACM Symposium on Cloud Computing 2025
  • Manjushri Fellowship, University of Pennsylvania, 2021
  • Financial Risk Manager (FRM) Certification, 2015
  • China Merchant Bank Scholarship, 2012-2014
  • Pioneering Undergraduate Fellowship, 2011-2014
  • First Prize, China High School Biology Olympiad, 2010