Picture for Yufeng Lyu

Yufeng Lyu

Accurate Expert Predictions in MoE Inference via Cross-Layer Gate

Add code
Feb 17, 2025
Figure 1 for Accurate Expert Predictions in MoE Inference via Cross-Layer Gate
Figure 2 for Accurate Expert Predictions in MoE Inference via Cross-Layer Gate
Figure 3 for Accurate Expert Predictions in MoE Inference via Cross-Layer Gate
Figure 4 for Accurate Expert Predictions in MoE Inference via Cross-Layer Gate
Viaarxiv icon

Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline

Add code
Feb 09, 2025
Viaarxiv icon