Picture for Jinyang Guo

Jinyang Guo

TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models

Add code
Dec 21, 2024
Viaarxiv icon

PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models

Add code
Dec 10, 2024
Viaarxiv icon

BiDM: Pushing the Limit of Quantization for Diffusion Models

Add code
Dec 08, 2024
Figure 1 for BiDM: Pushing the Limit of Quantization for Diffusion Models
Figure 2 for BiDM: Pushing the Limit of Quantization for Diffusion Models
Figure 3 for BiDM: Pushing the Limit of Quantization for Diffusion Models
Figure 4 for BiDM: Pushing the Limit of Quantization for Diffusion Models
Viaarxiv icon

LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment

Add code
Oct 28, 2024
Figure 1 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Figure 2 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Figure 3 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Figure 4 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Viaarxiv icon

HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration

Add code
Oct 02, 2024
Viaarxiv icon

A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms

Add code
Sep 25, 2024
Viaarxiv icon

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Add code
Jul 23, 2024
Figure 1 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Figure 2 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Figure 3 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Figure 4 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Viaarxiv icon

QVD: Post-training Quantization for Video Diffusion Models

Add code
Jul 16, 2024
Viaarxiv icon

Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes

Add code
May 09, 2024
Viaarxiv icon

PTQ4SAM: Post-Training Quantization for Segment Anything

Add code
May 06, 2024
Viaarxiv icon