Picture for Lei Guan

Lei Guan

PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction

Add code
Dec 05, 2023
Viaarxiv icon

AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis

Add code
Sep 05, 2023
Viaarxiv icon

XGrad: Boosting Gradient-Based Optimizers With Weight Prediction

Add code
May 26, 2023
Viaarxiv icon

Weight Prediction Boosts the Convergence of AdamW

Add code
Feb 01, 2023
Viaarxiv icon

XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training

Add code
Nov 20, 2019
Figure 1 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Figure 2 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Figure 3 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Figure 4 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training
Viaarxiv icon

Non-ergodic Convergence Analysis of Heavy-Ball Algorithms

Add code
Nov 09, 2018
Figure 1 for Non-ergodic Convergence Analysis of Heavy-Ball Algorithms
Viaarxiv icon

An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines

Add code
Sep 11, 2018
Figure 1 for An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines
Figure 2 for An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines
Figure 3 for An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines
Figure 4 for An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines
Viaarxiv icon