Picture for Siran Liu

Siran Liu

SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention

Add code
Jun 28, 2024
Figure 1 for SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention
Figure 2 for SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention
Figure 3 for SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention
Figure 4 for SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention
Viaarxiv icon

Can independent Metropolis beat crude Monte Carlo?

Add code
Jun 25, 2024
Viaarxiv icon

Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention

Add code
Jun 17, 2024
Figure 1 for Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention
Figure 2 for Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention
Figure 3 for Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention
Figure 4 for Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention
Viaarxiv icon