Picture for Jia wei

Jia wei

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Add code
Oct 03, 2024
Viaarxiv icon