Picture for Weilin Cai

Weilin Cai

Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts

Add code
Mar 07, 2025
Viaarxiv icon

Partial Experts Checkpoint: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training

Add code
Aug 08, 2024
Viaarxiv icon

A Survey on Mixture of Experts

Add code
Jun 26, 2024
Viaarxiv icon

Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts

Add code
Apr 07, 2024
Viaarxiv icon