Picture for Zewen Jin

Zewen Jin

BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference

Add code
Feb 24, 2025
Viaarxiv icon