Picture for Yueming Chen

Yueming Chen

Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

Add code
Apr 10, 2024
Viaarxiv icon