Picture for Dayal Singh Kalra

Dayal Singh Kalra

When Can You Get Away with Low Memory Adam?

Add code
Mar 03, 2025
Viaarxiv icon

Why Warmup the Learning Rate? Underlying Mechanisms and Improvements

Add code
Jun 13, 2024
Viaarxiv icon

Universal Sharpness Dynamics in Neural Network Training: Fixed Point Analysis, Edge of Stability, and Route to Chaos

Add code
Nov 03, 2023
Viaarxiv icon

Phase diagram of training dynamics in deep neural networks: effect of learning rate, depth, and width

Add code
Feb 23, 2023
Viaarxiv icon