SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Add code
Oct 03, 2024
Figure 1 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Figure 2 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Figure 3 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Figure 4 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: