Picture for Luoming Zhang

Luoming Zhang

ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification

Add code
May 23, 2024
Viaarxiv icon

Towards Accurate Post-training Quantization for Reparameterized Models

Add code
Feb 25, 2024
Viaarxiv icon

DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and Small Text

Add code
Nov 29, 2023
Viaarxiv icon

Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM

Add code
Oct 07, 2023
Viaarxiv icon

BiViT: Extremely Compressed Binary Vision Transformer

Add code
Nov 14, 2022
Viaarxiv icon

Binarizing by Classification: Is soft function really necessary?

Add code
May 16, 2022
Figure 1 for Binarizing by Classification: Is soft function really necessary?
Figure 2 for Binarizing by Classification: Is soft function really necessary?
Figure 3 for Binarizing by Classification: Is soft function really necessary?
Figure 4 for Binarizing by Classification: Is soft function really necessary?
Viaarxiv icon

Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization

Add code
Apr 08, 2022
Figure 1 for Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization
Figure 2 for Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization
Figure 3 for Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization
Figure 4 for Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization
Viaarxiv icon