Picture for Wenrui Huang

Wenrui Huang

Lil: Less is Less When Applying Post-Training Sparse-Attention Algorithms in Long-Decode Stage

Add code
Jan 06, 2026
Viaarxiv icon

EPIC: Efficient Position-Independent Context Caching for Serving Large Language Models

Add code
Oct 20, 2024
Viaarxiv icon