Picture for Banglan Liu

Banglan Liu

MoNTA: Accelerating Mixture-of-Experts Training with Network-Traffc-Aware Parallel Optimization

Add code
Nov 01, 2024
Viaarxiv icon