Picture for Domas Grigaliūnas

Domas Grigaliūnas

Inference acceleration for large language models using "stairs" assisted greedy generation

Add code
Jul 29, 2024
Viaarxiv icon