Picture for Zelei Shao

Zelei Shao

MiLo: Efficient Quantized MoE Inference with Mixture of Low-Rank Compensators

Add code
Apr 03, 2025
Viaarxiv icon