Picture for Wencong Xiao

Wencong Xiao

Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach

Add code
Jun 07, 2024
Figure 1 for Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach
Figure 2 for Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach
Figure 3 for Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach
Figure 4 for Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach
Viaarxiv icon

Llumnix: Dynamic Scheduling for Large Language Model Serving

Add code
Jun 05, 2024
Figure 1 for Llumnix: Dynamic Scheduling for Large Language Model Serving
Figure 2 for Llumnix: Dynamic Scheduling for Large Language Model Serving
Figure 3 for Llumnix: Dynamic Scheduling for Large Language Model Serving
Figure 4 for Llumnix: Dynamic Scheduling for Large Language Model Serving
Viaarxiv icon

FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving

Add code
Aug 14, 2023
Viaarxiv icon

MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs

Add code
Jan 01, 2023
Viaarxiv icon

Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training

Add code
Dec 16, 2020
Figure 1 for Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training
Figure 2 for Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training
Figure 3 for Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training
Figure 4 for Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training
Viaarxiv icon

Balanced Sparsity for Efficient DNN Inference on GPU

Add code
Nov 02, 2018
Figure 1 for Balanced Sparsity for Efficient DNN Inference on GPU
Figure 2 for Balanced Sparsity for Efficient DNN Inference on GPU
Figure 3 for Balanced Sparsity for Efficient DNN Inference on GPU
Figure 4 for Balanced Sparsity for Efficient DNN Inference on GPU
Viaarxiv icon