Picture for Luke Hudlass-Galley

Luke Hudlass-Galley

Approximate Top-$k$ for Increased Parallelism

Add code
Dec 05, 2024
Figure 1 for Approximate Top-$k$ for Increased Parallelism
Figure 2 for Approximate Top-$k$ for Increased Parallelism
Figure 3 for Approximate Top-$k$ for Increased Parallelism
Figure 4 for Approximate Top-$k$ for Increased Parallelism
Viaarxiv icon

SparQ Attention: Bandwidth-Efficient LLM Inference

Add code
Dec 08, 2023
Viaarxiv icon