Picture for Valeria Ruscio

Valeria Ruscio

Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers

Add code
Oct 23, 2024
Figure 1 for Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers
Figure 2 for Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers
Figure 3 for Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers
Figure 4 for Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers
Viaarxiv icon

Attention-likelihood relationship in transformers

Add code
Mar 15, 2023
Viaarxiv icon