Picture for Haiduo Huang

Haiduo Huang

Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE

Add code
Feb 10, 2025
Viaarxiv icon

Nearly Lossless Adaptive Bit Switching

Add code
Feb 03, 2025
Viaarxiv icon

Partial Channel Network: Compute Fewer, Perform Better

Add code
Feb 03, 2025
Viaarxiv icon

FTP: A Fine-grained Token-wise Pruner for Large Language Models via Token Routing

Add code
Dec 16, 2024
Viaarxiv icon