Picture for Sanjit Neelam

Sanjit Neelam

SPIRe: Boosting LLM Inference Throughput with Speculative Decoding

Add code
Apr 08, 2025
Viaarxiv icon