About

Research identity first

I'm an incoming PhD student in Computer Science at Princeton University, starting in Fall 2026, where I will be advised by Tri Dao and Ravi Netravali. I previously worked as a Visiting Student Researcher at the Sky Computing Lab at UC Berkeley, advised by Ion Stoica, and also collaborated with Joseph E. Gonzalez and Matei Zaharia. I completed my B.S. in Computer Science (Turing Honors Class) at Renmin University of China.

I work broadly on machine learning systems, with interests in efficient infrastructure, training and inference systems, and the design of systems for emerging AI workloads.

Agenda

Research agenda

  • Machine learning systems
  • Training and inference infrastructure
  • Efficient systems for AI workloads
  • Systems support for emerging AI applications

Builder lens

Systems ideas into tools people use

Alongside research, I translate ML systems ideas into infrastructure and tools that people actually use — from a high-availability controller running in production to a vector index with tens of thousands of users.

On SkyPilot, I built the High Availability Controller for SkyServe — the layer that keeps model serving online across node and region failures — now adopted by startups including Hypermode. On LEANN, I led the push from research prototype to a low-storage vector index that runs on personal devices, with tens of thousands of downloads and a Best Paper Award at MLSys 2026.