Picture for Minxi Yan

Minxi Yan

GWQ: Gradient-Aware Weight Quantization for Large Language Models

Add code
Oct 30, 2024
Viaarxiv icon