Picture for Jialong Guo

Jialong Guo

SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization

Add code
May 19, 2024
Viaarxiv icon