Picture for Wuhui Chen

Wuhui Chen

Accurate Expert Predictions in MoE Inference via Cross-Layer Gate

Add code
Feb 17, 2025
Viaarxiv icon

Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline

Add code
Feb 09, 2025
Viaarxiv icon

Training and Serving System of Foundation Models: A Comprehensive Survey

Add code
Jan 05, 2024
Viaarxiv icon

Intelligence-Endogenous Management Platform for Computing and Network Convergence

Add code
Aug 07, 2023
Figure 1 for Intelligence-Endogenous Management Platform for Computing and Network Convergence
Figure 2 for Intelligence-Endogenous Management Platform for Computing and Network Convergence
Figure 3 for Intelligence-Endogenous Management Platform for Computing and Network Convergence
Figure 4 for Intelligence-Endogenous Management Platform for Computing and Network Convergence
Viaarxiv icon