Picture for Jianfei Chen

Jianfei Chen

Accurate INT8 Training Through Dynamic Block-Level Fallback

Add code
Mar 11, 2025
Viaarxiv icon

Oscillation-Reduced MXFP4 Training for Vision Transformers

Add code
Feb 28, 2025
Viaarxiv icon

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Add code
Feb 25, 2025
Viaarxiv icon

Elucidating the Preconditioning in Consistency Distillation

Add code
Feb 05, 2025
Viaarxiv icon

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Add code
Feb 03, 2025
Figure 1 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 2 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 3 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 4 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Viaarxiv icon

Visual Generation Without Guidance

Add code
Jan 26, 2025
Figure 1 for Visual Generation Without Guidance
Figure 2 for Visual Generation Without Guidance
Figure 3 for Visual Generation Without Guidance
Figure 4 for Visual Generation Without Guidance
Viaarxiv icon

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Add code
Dec 19, 2024
Viaarxiv icon

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Add code
Nov 17, 2024
Figure 1 for SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration
Figure 2 for SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration
Figure 3 for SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration
Figure 4 for SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration
Viaarxiv icon

Consistency Diffusion Bridge Models

Add code
Oct 31, 2024
Figure 1 for Consistency Diffusion Bridge Models
Figure 2 for Consistency Diffusion Bridge Models
Figure 3 for Consistency Diffusion Bridge Models
Figure 4 for Consistency Diffusion Bridge Models
Viaarxiv icon

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Add code
Oct 25, 2024
Figure 1 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 2 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 3 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 4 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Viaarxiv icon