Picture for Daniel Heinlein

Daniel Heinlein

SPIRe: Boosting LLM Inference Throughput with Speculative Decoding

Add code
Apr 08, 2025
Viaarxiv icon