Picture for Robert Gower

Robert Gower

Non-Euclidean Gradient Descent Operates at the Edge of Stability

Add code
Mar 05, 2026
Viaarxiv icon

In Search of Adam's Secret Sauce

Add code
May 27, 2025
Viaarxiv icon

The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm

Add code
May 22, 2025
Figure 1 for The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm
Figure 2 for The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm
Figure 3 for The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm
Figure 4 for The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm
Viaarxiv icon

SGD with Clipping is Secretly Estimating the Median Gradient

Add code
Feb 20, 2024
Figure 1 for SGD with Clipping is Secretly Estimating the Median Gradient
Figure 2 for SGD with Clipping is Secretly Estimating the Median Gradient
Figure 3 for SGD with Clipping is Secretly Estimating the Median Gradient
Figure 4 for SGD with Clipping is Secretly Estimating the Median Gradient
Viaarxiv icon

SANIA: Polyak-type Optimization Framework Leads to Scale Invariant Stochastic Algorithms

Add code
Dec 28, 2023
Figure 1 for SANIA: Polyak-type Optimization Framework Leads to Scale Invariant Stochastic Algorithms
Figure 2 for SANIA: Polyak-type Optimization Framework Leads to Scale Invariant Stochastic Algorithms
Figure 3 for SANIA: Polyak-type Optimization Framework Leads to Scale Invariant Stochastic Algorithms
Figure 4 for SANIA: Polyak-type Optimization Framework Leads to Scale Invariant Stochastic Algorithms
Viaarxiv icon

Variational Inference with Gaussian Score Matching

Add code
Jul 15, 2023
Viaarxiv icon

Provable convergence guarantees for black-box variational inference

Add code
Jun 04, 2023
Figure 1 for Provable convergence guarantees for black-box variational inference
Viaarxiv icon