Picture for Michael Goin

Michael Goin

Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization

Add code
Aug 31, 2024
Viaarxiv icon

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Add code
May 06, 2024
Viaarxiv icon

Sparse Fine-tuning for Inference Acceleration of Large Language Models

Add code
Oct 13, 2023
Viaarxiv icon

The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models

Add code
Mar 14, 2022
Figure 1 for The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models
Figure 2 for The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models
Figure 3 for The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models
Figure 4 for The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models
Viaarxiv icon