Picture for Zhengao Li

Zhengao Li

ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning

Add code
Jan 25, 2025
Viaarxiv icon