Picture for Dmitry Yarotsky

Dmitry Yarotsky

A prism hierarchy of learning regimes in large linear autoencoders

Add code
Jun 03, 2026
Viaarxiv icon

Gradient Flow Through Diagram Expansions: Learning Regimes and Explicit Solutions

Add code
Feb 04, 2026
Viaarxiv icon

Corner Gradient Descent

Add code
Apr 16, 2025
Figure 1 for Corner Gradient Descent
Figure 2 for Corner Gradient Descent
Figure 3 for Corner Gradient Descent
Figure 4 for Corner Gradient Descent
Viaarxiv icon

SGD with memory: fundamental properties and stochastic acceleration

Add code
Oct 05, 2024
Figure 1 for SGD with memory: fundamental properties and stochastic acceleration
Figure 2 for SGD with memory: fundamental properties and stochastic acceleration
Figure 3 for SGD with memory: fundamental properties and stochastic acceleration
Figure 4 for SGD with memory: fundamental properties and stochastic acceleration
Viaarxiv icon

Generalization error of spectral algorithms

Add code
Mar 18, 2024
Figure 1 for Generalization error of spectral algorithms
Figure 2 for Generalization error of spectral algorithms
Figure 3 for Generalization error of spectral algorithms
Figure 4 for Generalization error of spectral algorithms
Viaarxiv icon

Learning high-dimensional targets by two-parameter models and gradient flow

Add code
Feb 26, 2024
Figure 1 for Learning high-dimensional targets by two-parameter models and gradient flow
Figure 2 for Learning high-dimensional targets by two-parameter models and gradient flow
Figure 3 for Learning high-dimensional targets by two-parameter models and gradient flow
Viaarxiv icon

Structure of universal formulas

Add code
Nov 07, 2023
Viaarxiv icon

A view of mini-batch SGD via generating functions: conditions of convergence, phase transitions, benefit from negative momenta

Add code
Jun 22, 2022
Figure 1 for A view of mini-batch SGD via generating functions: conditions of convergence, phase transitions, benefit from negative momenta
Figure 2 for A view of mini-batch SGD via generating functions: conditions of convergence, phase transitions, benefit from negative momenta
Figure 3 for A view of mini-batch SGD via generating functions: conditions of convergence, phase transitions, benefit from negative momenta
Figure 4 for A view of mini-batch SGD via generating functions: conditions of convergence, phase transitions, benefit from negative momenta
Viaarxiv icon

Embedded Ensembles: Infinite Width Limit and Operating Regimes

Add code
Feb 24, 2022
Figure 1 for Embedded Ensembles: Infinite Width Limit and Operating Regimes
Figure 2 for Embedded Ensembles: Infinite Width Limit and Operating Regimes
Figure 3 for Embedded Ensembles: Infinite Width Limit and Operating Regimes
Figure 4 for Embedded Ensembles: Infinite Width Limit and Operating Regimes
Viaarxiv icon

Tight Convergence Rate Bounds for Optimization Under Power Law Spectral Conditions

Add code
Feb 02, 2022
Figure 1 for Tight Convergence Rate Bounds for Optimization Under Power Law Spectral Conditions
Figure 2 for Tight Convergence Rate Bounds for Optimization Under Power Law Spectral Conditions
Figure 3 for Tight Convergence Rate Bounds for Optimization Under Power Law Spectral Conditions
Figure 4 for Tight Convergence Rate Bounds for Optimization Under Power Law Spectral Conditions
Viaarxiv icon