Picture for Lizhang Chen

Lizhang Chen

Cautious Optimizers: Improving Training with One Line of Code

Add code
Nov 25, 2024
Figure 1 for Cautious Optimizers: Improving Training with One Line of Code
Figure 2 for Cautious Optimizers: Improving Training with One Line of Code
Figure 3 for Cautious Optimizers: Improving Training with One Line of Code
Figure 4 for Cautious Optimizers: Improving Training with One Line of Code
Viaarxiv icon

Memory-Efficient LLM Training with Online Subspace Descent

Add code
Aug 23, 2024
Viaarxiv icon

H-Fac: Memory-Efficient Optimization with Factorized Hamiltonian Descent

Add code
Jun 17, 2024
Viaarxiv icon

Communication Efficient Distributed Training with Distributed Lion

Add code
Mar 30, 2024
Viaarxiv icon

Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts

Add code
Oct 12, 2023
Viaarxiv icon

An Experimental Study of Semantic Continuity for Deep Learning Models

Add code
Nov 19, 2020
Figure 1 for An Experimental Study of Semantic Continuity for Deep Learning Models
Figure 2 for An Experimental Study of Semantic Continuity for Deep Learning Models
Figure 3 for An Experimental Study of Semantic Continuity for Deep Learning Models
Figure 4 for An Experimental Study of Semantic Continuity for Deep Learning Models
Viaarxiv icon