Picture for Mingda Wan

Mingda Wan

Theoretical Constraints on the Expressive Power of $\mathsf{RoPE}$-based Tensor Attention Transformers

Add code
Dec 23, 2024
Viaarxiv icon