Picture for Nolan Miller

Nolan Miller

Narrowing the Focus: Learned Optimizers for Pretrained Models

Add code
Aug 21, 2024
Viaarxiv icon

Uncovering mesa-optimization algorithms in Transformers

Add code
Sep 11, 2023
Figure 1 for Uncovering mesa-optimization algorithms in Transformers
Figure 2 for Uncovering mesa-optimization algorithms in Transformers
Figure 3 for Uncovering mesa-optimization algorithms in Transformers
Figure 4 for Uncovering mesa-optimization algorithms in Transformers
Viaarxiv icon

Training trajectories, mini-batch losses and the curious role of the learning rate

Add code
Jan 05, 2023
Viaarxiv icon

Decentralized Learning with Multi-Headed Distillation

Add code
Nov 28, 2022
Viaarxiv icon

Meta-Learning Bidirectional Update Rules

Add code
Apr 10, 2021
Figure 1 for Meta-Learning Bidirectional Update Rules
Figure 2 for Meta-Learning Bidirectional Update Rules
Figure 3 for Meta-Learning Bidirectional Update Rules
Figure 4 for Meta-Learning Bidirectional Update Rules
Viaarxiv icon