Picture for Zixian Zhu

Zixian Zhu

GWQ: Gradient-Aware Weight Quantization for Large Language Models

Add code
Oct 30, 2024
Viaarxiv icon