Picture for Dan Alistarh

Dan Alistarh

Towards Combinatorial Interpretability of Neural Computation

Add code
Apr 10, 2025
Viaarxiv icon

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Add code
Apr 09, 2025
Viaarxiv icon

Compression Scaling Laws:Unifying Sparsity and Quantization

Add code
Feb 23, 2025
Viaarxiv icon

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Add code
Feb 11, 2025
Viaarxiv icon

Position: It's Time to Act on the Risk of Efficient Personalized Text Generation

Add code
Feb 10, 2025
Viaarxiv icon

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Add code
Feb 07, 2025
Figure 1 for QuEST: Stable Training of LLMs with 1-Bit Weights and Activations
Figure 2 for QuEST: Stable Training of LLMs with 1-Bit Weights and Activations
Figure 3 for QuEST: Stable Training of LLMs with 1-Bit Weights and Activations
Figure 4 for QuEST: Stable Training of LLMs with 1-Bit Weights and Activations
Viaarxiv icon

Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models

Add code
Jan 31, 2025
Viaarxiv icon

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

Add code
Jan 05, 2025
Figure 1 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 2 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 3 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 4 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Viaarxiv icon

Pushing the Limits of Large Language Model Quantization via the Linearity Theorem

Add code
Nov 26, 2024
Figure 1 for Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Figure 2 for Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Figure 3 for Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Figure 4 for Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Viaarxiv icon

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Add code
Nov 04, 2024
Figure 1 for "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
Figure 2 for "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
Figure 3 for "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
Figure 4 for "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
Viaarxiv icon