Picture for Valeria Ruscio

Valeria Ruscio

Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers

Add code
Oct 23, 2024
Viaarxiv icon

Attention-likelihood relationship in transformers

Add code
Mar 15, 2023
Viaarxiv icon