Picture for Kyuyeun Kim

Kyuyeun Kim

MimiQ: Low-Bit Data-Free Quantization of Vision Transformers

Add code
Jul 29, 2024
Viaarxiv icon

Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization

Add code
Jun 17, 2024
Viaarxiv icon