Picture for Yupeng Su

Yupeng Su

LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models

Add code
Aug 20, 2024
Viaarxiv icon

APTQ: Attention-aware Post-Training Mixed-Precision Quantization for Large Language Models

Add code
Feb 21, 2024
Viaarxiv icon