Peri-LN: Revisiting Layer Normalization in the Transformer Architecture

Add code
Feb 04, 2025
Figure 1 for Peri-LN: Revisiting Layer Normalization in the Transformer Architecture
Figure 2 for Peri-LN: Revisiting Layer Normalization in the Transformer Architecture
Figure 3 for Peri-LN: Revisiting Layer Normalization in the Transformer Architecture
Figure 4 for Peri-LN: Revisiting Layer Normalization in the Transformer Architecture

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: