Picture for Lizhang Chen

Lizhang Chen

Improving Adaptive Moment Optimization via Preconditioner Diagonalization

Add code
Feb 11, 2025
Viaarxiv icon

Cautious Optimizers: Improving Training with One Line of Code

Add code
Nov 25, 2024
Figure 1 for Cautious Optimizers: Improving Training with One Line of Code
Figure 2 for Cautious Optimizers: Improving Training with One Line of Code
Figure 3 for Cautious Optimizers: Improving Training with One Line of Code
Figure 4 for Cautious Optimizers: Improving Training with One Line of Code
Viaarxiv icon

Memory-Efficient LLM Training with Online Subspace Descent

Add code
Aug 23, 2024
Viaarxiv icon

H-Fac: Memory-Efficient Optimization with Factorized Hamiltonian Descent

Add code
Jun 17, 2024
Figure 1 for H-Fac: Memory-Efficient Optimization with Factorized Hamiltonian Descent
Figure 2 for H-Fac: Memory-Efficient Optimization with Factorized Hamiltonian Descent
Figure 3 for H-Fac: Memory-Efficient Optimization with Factorized Hamiltonian Descent
Figure 4 for H-Fac: Memory-Efficient Optimization with Factorized Hamiltonian Descent
Viaarxiv icon

Communication Efficient Distributed Training with Distributed Lion

Add code
Mar 30, 2024
Figure 1 for Communication Efficient Distributed Training with Distributed Lion
Figure 2 for Communication Efficient Distributed Training with Distributed Lion
Figure 3 for Communication Efficient Distributed Training with Distributed Lion
Figure 4 for Communication Efficient Distributed Training with Distributed Lion
Viaarxiv icon

Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts

Add code
Oct 12, 2023
Figure 1 for Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts
Figure 2 for Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts
Figure 3 for Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts
Figure 4 for Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts
Viaarxiv icon

An Experimental Study of Semantic Continuity for Deep Learning Models

Add code
Nov 19, 2020
Figure 1 for An Experimental Study of Semantic Continuity for Deep Learning Models
Figure 2 for An Experimental Study of Semantic Continuity for Deep Learning Models
Figure 3 for An Experimental Study of Semantic Continuity for Deep Learning Models
Figure 4 for An Experimental Study of Semantic Continuity for Deep Learning Models
Viaarxiv icon