Picture for Jinyang Guo

Jinyang Guo

APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers

Add code
Apr 03, 2025
Viaarxiv icon

Efficient Token Compression for Vision Transformer with Spatial Information Preserved

Add code
Mar 30, 2025
Viaarxiv icon

Dynamic Parallel Tree Search for Efficient LLM Reasoning

Add code
Feb 22, 2025
Viaarxiv icon

TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models

Add code
Dec 21, 2024
Figure 1 for TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models
Figure 2 for TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models
Figure 3 for TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models
Figure 4 for TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models
Viaarxiv icon

PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models

Add code
Dec 10, 2024
Viaarxiv icon

BiDM: Pushing the Limit of Quantization for Diffusion Models

Add code
Dec 08, 2024
Figure 1 for BiDM: Pushing the Limit of Quantization for Diffusion Models
Figure 2 for BiDM: Pushing the Limit of Quantization for Diffusion Models
Figure 3 for BiDM: Pushing the Limit of Quantization for Diffusion Models
Figure 4 for BiDM: Pushing the Limit of Quantization for Diffusion Models
Viaarxiv icon

LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment

Add code
Oct 28, 2024
Figure 1 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Figure 2 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Figure 3 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Figure 4 for LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Viaarxiv icon

HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration

Add code
Oct 02, 2024
Viaarxiv icon

A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms

Add code
Sep 25, 2024
Figure 1 for A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms
Figure 2 for A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms
Figure 3 for A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms
Figure 4 for A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms
Viaarxiv icon

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Add code
Jul 23, 2024
Figure 1 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Figure 2 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Figure 3 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Figure 4 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Viaarxiv icon