Picture for Yanhan Li

Yanhan Li

MoE$^2$: Optimizing Collaborative Inference for Edge Large Language Models

Add code
Jan 16, 2025
Viaarxiv icon