Picture for Dan Alistarh

Dan Alistarh

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Add code
Nov 04, 2024
Viaarxiv icon

LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics

Add code
Oct 21, 2024
Viaarxiv icon

EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search

Add code
Oct 18, 2024
Viaarxiv icon

Scalable Mechanistic Neural Networks

Add code
Oct 08, 2024
Viaarxiv icon

Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization

Add code
Aug 31, 2024
Viaarxiv icon

The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information

Add code
Aug 30, 2024
Viaarxiv icon

MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models

Add code
Aug 21, 2024
Viaarxiv icon

Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

Sparse Expansion and Neuronal Disentanglement

Add code
May 24, 2024
Viaarxiv icon

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence

Add code
May 24, 2024
Viaarxiv icon