Picture for Weilun Zhao

Weilun Zhao

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Add code
Feb 20, 2025
Viaarxiv icon