Picture for Yifu Ding

Yifu Ding

LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment

Add code
Oct 28, 2024
Figure 1 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Figure 2 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Figure 3 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Figure 4 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Viaarxiv icon

A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms

Add code
Sep 25, 2024
Viaarxiv icon

PTQ4SAM: Post-Training Quantization for Segment Anything

Add code
May 06, 2024
Viaarxiv icon

DB-LLM: Accurate Dual-Binarization for Efficient LLMs

Add code
Feb 19, 2024
Figure 1 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Figure 2 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Figure 3 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Figure 4 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Viaarxiv icon

OHQ: On-chip Hardware-aware Quantization

Add code
Sep 05, 2023
Viaarxiv icon

Towards Accurate Post-Training Quantization for Vision Transformer

Add code
Mar 25, 2023
Viaarxiv icon

BiBench: Benchmarking and Analyzing Network Binarization

Add code
Jan 26, 2023
Figure 1 for BiBench: Benchmarking and Analyzing Network Binarization
Figure 2 for BiBench: Benchmarking and Analyzing Network Binarization
Figure 3 for BiBench: Benchmarking and Analyzing Network Binarization
Figure 4 for BiBench: Benchmarking and Analyzing Network Binarization
Viaarxiv icon

BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance

Add code
Nov 13, 2022
Viaarxiv icon

BiBERT: Accurate Fully Binarized BERT

Add code
Mar 12, 2022
Figure 1 for BiBERT: Accurate Fully Binarized BERT
Figure 2 for BiBERT: Accurate Fully Binarized BERT
Figure 3 for BiBERT: Accurate Fully Binarized BERT
Figure 4 for BiBERT: Accurate Fully Binarized BERT
Viaarxiv icon

BiFSMN: Binary Neural Network for Keyword Spotting

Add code
Feb 15, 2022
Figure 1 for BiFSMN: Binary Neural Network for Keyword Spotting
Figure 2 for BiFSMN: Binary Neural Network for Keyword Spotting
Figure 3 for BiFSMN: Binary Neural Network for Keyword Spotting
Figure 4 for BiFSMN: Binary Neural Network for Keyword Spotting
Viaarxiv icon