SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

Add code
Jan 04, 2021
Figure 1 for SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Figure 2 for SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Figure 3 for SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Figure 4 for SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: