Picture for Yuxuan Yue

Yuxuan Yue

WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More

Add code
Feb 20, 2024
Viaarxiv icon