Picture for Umut Simsekli

Umut Simsekli

Understanding the Generalization Error of Markov algorithms through Poissonization

Add code
Feb 11, 2025
Viaarxiv icon

Algorithmic Stability of Stochastic Gradient Descent with Momentum under Heavy-Tailed Noise

Add code
Feb 02, 2025
Viaarxiv icon

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Add code
Jan 31, 2025
Figure 1 for The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
Figure 2 for The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
Figure 3 for The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
Figure 4 for The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
Viaarxiv icon

Piecewise deterministic generative models

Add code
Jul 28, 2024
Viaarxiv icon

Denoising Lévy Probabilistic Models

Add code
Jul 26, 2024
Viaarxiv icon

Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets

Add code
Apr 26, 2024
Viaarxiv icon

SGD with Clipping is Secretly Estimating the Median Gradient

Add code
Feb 20, 2024
Figure 1 for SGD with Clipping is Secretly Estimating the Median Gradient
Figure 2 for SGD with Clipping is Secretly Estimating the Median Gradient
Figure 3 for SGD with Clipping is Secretly Estimating the Median Gradient
Figure 4 for SGD with Clipping is Secretly Estimating the Median Gradient
Viaarxiv icon

A PAC-Bayesian Link Between Generalisation and Flat Minima

Add code
Feb 13, 2024
Viaarxiv icon

Approximate Heavy Tails in Offline (Multi-Pass) Stochastic Gradient Descent

Add code
Oct 27, 2023
Viaarxiv icon

Nonparametric Linear Feature Learning in Regression Through Regularisation

Add code
Jul 25, 2023
Viaarxiv icon