Picture for Tinashu Zhu

Tinashu Zhu

S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models

Add code
Jul 02, 2024
Viaarxiv icon