Picture for Sangjin Choi

Sangjin Choi

Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding

Add code
Feb 08, 2025
Viaarxiv icon