Picture for Cong Xie

Cong Xie

SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training

Add code
Oct 20, 2024
Figure 1 for SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
Figure 2 for SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
Figure 3 for SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
Figure 4 for SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
Viaarxiv icon

MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router

Add code
Oct 15, 2024
Figure 1 for MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router
Figure 2 for MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router
Figure 3 for MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router
Figure 4 for MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router
Viaarxiv icon

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Add code
Feb 23, 2024
Figure 1 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Figure 2 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Figure 3 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Figure 4 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Viaarxiv icon

Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding

Add code
Jan 28, 2024
Viaarxiv icon

LEMON: Lossless model expansion

Add code
Oct 12, 2023
Figure 1 for LEMON: Lossless model expansion
Figure 2 for LEMON: Lossless model expansion
Figure 3 for LEMON: Lossless model expansion
Figure 4 for LEMON: Lossless model expansion
Viaarxiv icon

Baechi: Fast Device Placement of Machine Learning Graphs

Add code
Jan 20, 2023
Figure 1 for Baechi: Fast Device Placement of Machine Learning Graphs
Figure 2 for Baechi: Fast Device Placement of Machine Learning Graphs
Figure 3 for Baechi: Fast Device Placement of Machine Learning Graphs
Figure 4 for Baechi: Fast Device Placement of Machine Learning Graphs
Viaarxiv icon

Learning Shape Priors by Pairwise Comparison for Robust Semantic Segmentation

Add code
Apr 23, 2022
Figure 1 for Learning Shape Priors by Pairwise Comparison for Robust Semantic Segmentation
Figure 2 for Learning Shape Priors by Pairwise Comparison for Robust Semantic Segmentation
Viaarxiv icon

RECIST-Net: Lesion detection via grouping keypoints on RECIST-based annotation

Add code
Jul 19, 2021
Figure 1 for RECIST-Net: Lesion detection via grouping keypoints on RECIST-based annotation
Figure 2 for RECIST-Net: Lesion detection via grouping keypoints on RECIST-based annotation
Figure 3 for RECIST-Net: Lesion detection via grouping keypoints on RECIST-based annotation
Figure 4 for RECIST-Net: Lesion detection via grouping keypoints on RECIST-based annotation
Viaarxiv icon

Compressed Communication for Distributed Training: Adaptive Methods and System

Add code
May 17, 2021
Figure 1 for Compressed Communication for Distributed Training: Adaptive Methods and System
Figure 2 for Compressed Communication for Distributed Training: Adaptive Methods and System
Figure 3 for Compressed Communication for Distributed Training: Adaptive Methods and System
Figure 4 for Compressed Communication for Distributed Training: Adaptive Methods and System
Viaarxiv icon

Visual Steering for One-Shot Deep Neural Network Synthesis

Add code
Sep 28, 2020
Figure 1 for Visual Steering for One-Shot Deep Neural Network Synthesis
Figure 2 for Visual Steering for One-Shot Deep Neural Network Synthesis
Figure 3 for Visual Steering for One-Shot Deep Neural Network Synthesis
Figure 4 for Visual Steering for One-Shot Deep Neural Network Synthesis
Viaarxiv icon