Picture for Xinran Hong

Xinran Hong

LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment

Add code
Dec 24, 2024
Viaarxiv icon