Picture for Zhuofu Chen

Zhuofu Chen

TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention

Add code
Oct 07, 2024
Viaarxiv icon