Picture for Siran Liu

Siran Liu

SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention

Add code
Jun 28, 2024
Viaarxiv icon

Can independent Metropolis beat crude Monte Carlo?

Add code
Jun 25, 2024
Viaarxiv icon

Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention

Add code
Jun 17, 2024
Viaarxiv icon