Picture for Yupeng Su

Yupeng Su

Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning

Add code
Jan 06, 2025
Viaarxiv icon

LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models

Add code
Aug 20, 2024
Viaarxiv icon

APTQ: Attention-aware Post-Training Mixed-Precision Quantization for Large Language Models

Add code
Feb 21, 2024
Viaarxiv icon