Picture for Dan Alistarh

Dan Alistarh

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Add code
Feb 11, 2025
Viaarxiv icon

Position: It's Time to Act on the Risk of Efficient Personalized Text Generation

Add code
Feb 10, 2025
Viaarxiv icon

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Add code
Feb 07, 2025
Viaarxiv icon

Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models

Add code
Jan 31, 2025
Viaarxiv icon

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

Add code
Jan 05, 2025
Figure 1 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 2 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 3 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 4 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Viaarxiv icon

Pushing the Limits of Large Language Model Quantization via the Linearity Theorem

Add code
Nov 26, 2024
Figure 1 for Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Figure 2 for Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Figure 3 for Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Figure 4 for Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Viaarxiv icon

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Add code
Nov 04, 2024
Figure 1 for "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
Figure 2 for "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
Figure 3 for "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
Figure 4 for "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
Viaarxiv icon

LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics

Add code
Oct 21, 2024
Viaarxiv icon

EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search

Add code
Oct 18, 2024
Viaarxiv icon

Scalable Mechanistic Neural Networks

Add code
Oct 08, 2024
Figure 1 for Scalable Mechanistic Neural Networks
Figure 2 for Scalable Mechanistic Neural Networks
Figure 3 for Scalable Mechanistic Neural Networks
Figure 4 for Scalable Mechanistic Neural Networks
Viaarxiv icon