Picture for Steffen Dereich

Steffen Dereich

Convergence rates for the Adam optimizer

Add code
Jul 29, 2024
Viaarxiv icon

Non-convergence of Adam and other adaptive stochastic gradient descent optimization methods for non-vanishing learning rates

Add code
Jul 11, 2024
Viaarxiv icon

Learning rate adaptive stochastic gradient descent optimization methods: numerical simulations for deep learning methods for partial differential equations and convergence analyses

Add code
Jun 20, 2024
Viaarxiv icon

On the existence of optimal shallow feedforward networks with ReLU activation

Add code
Mar 06, 2023
Viaarxiv icon

On the existence of minimizers in shallow residual ReLU neural network optimization landscapes

Add code
Feb 28, 2023
Viaarxiv icon

Convergence of stochastic gradient descent schemes for Lojasiewicz-landscapes

Add code
Feb 16, 2021
Viaarxiv icon