Picture for Kuangyu Ding

Kuangyu Ding

Optimization Hyper-parameter Laws for Large Language Models

Add code
Sep 07, 2024
Viaarxiv icon

Developing Lagrangian-based Methods for Nonsmooth Nonconvex Optimization

Add code
Apr 15, 2024
Viaarxiv icon

Adam-family Methods with Decoupled Weight Decay in Deep Learning

Add code
Oct 13, 2023
Viaarxiv icon

Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning

Add code
Jun 29, 2023
Figure 1 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Figure 2 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Figure 3 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Figure 4 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Viaarxiv icon