Picture for Jia wei

Jia wei

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Add code
Oct 03, 2024
Figure 1 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Figure 2 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Figure 3 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Figure 4 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Viaarxiv icon