Picture for Yuezhou Hu

Yuezhou Hu

S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training

Add code
Sep 13, 2024
Figure 1 for S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Figure 2 for S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Figure 3 for S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Figure 4 for S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Viaarxiv icon

Pruning Large Language Models with Semi-Structural Adaptive Sparse Training

Add code
Jul 30, 2024
Viaarxiv icon

Accelerating Transformer Pre-Training with 2:4 Sparsity

Add code
Apr 02, 2024
Viaarxiv icon