Picture for Ruidong Zhu

Ruidong Zhu

MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism

Add code
Apr 03, 2025
Viaarxiv icon