Picture for Lancheng Zou

Lancheng Zou

CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference

Add code
Feb 06, 2025
Viaarxiv icon

MixPE: Quantization and Hardware Co-design for Efficient LLM Inference

Add code
Nov 25, 2024
Viaarxiv icon