Picture for Shen-Yi Zhao

Shen-Yi Zhao

Stochastic Normalized Gradient Descent with Momentum for Large Batch Training

Add code
Jul 28, 2020
Figure 1 for Stochastic Normalized Gradient Descent with Momentum for Large Batch Training
Figure 2 for Stochastic Normalized Gradient Descent with Momentum for Large Batch Training
Figure 3 for Stochastic Normalized Gradient Descent with Momentum for Large Batch Training
Figure 4 for Stochastic Normalized Gradient Descent with Momentum for Large Batch Training
Viaarxiv icon

Stagewise Enlargement of Batch Size for SGD-based Learning

Add code
Feb 27, 2020
Figure 1 for Stagewise Enlargement of Batch Size for SGD-based Learning
Figure 2 for Stagewise Enlargement of Batch Size for SGD-based Learning
Figure 3 for Stagewise Enlargement of Batch Size for SGD-based Learning
Figure 4 for Stagewise Enlargement of Batch Size for SGD-based Learning
Viaarxiv icon

ADASS: Adaptive Sample Selection for Training Acceleration

Add code
Jun 11, 2019
Figure 1 for ADASS: Adaptive Sample Selection for Training Acceleration
Figure 2 for ADASS: Adaptive Sample Selection for Training Acceleration
Figure 3 for ADASS: Adaptive Sample Selection for Training Acceleration
Figure 4 for ADASS: Adaptive Sample Selection for Training Acceleration
Viaarxiv icon

Clustered Reinforcement Learning

Add code
Jun 06, 2019
Figure 1 for Clustered Reinforcement Learning
Figure 2 for Clustered Reinforcement Learning
Figure 3 for Clustered Reinforcement Learning
Figure 4 for Clustered Reinforcement Learning
Viaarxiv icon

On the Convergence of Memory-Based Distributed SGD

Add code
May 30, 2019
Figure 1 for On the Convergence of Memory-Based Distributed SGD
Viaarxiv icon

Global Momentum Compression for Sparse Communication in Distributed SGD

Add code
May 30, 2019
Figure 1 for Global Momentum Compression for Sparse Communication in Distributed SGD
Figure 2 for Global Momentum Compression for Sparse Communication in Distributed SGD
Viaarxiv icon

Quantized Epoch-SGD for Communication-Efficient Distributed Learning

Add code
Jan 10, 2019
Figure 1 for Quantized Epoch-SGD for Communication-Efficient Distributed Learning
Figure 2 for Quantized Epoch-SGD for Communication-Efficient Distributed Learning
Figure 3 for Quantized Epoch-SGD for Communication-Efficient Distributed Learning
Figure 4 for Quantized Epoch-SGD for Communication-Efficient Distributed Learning
Viaarxiv icon

Proximal SCOPE for Distributed Sparse Learning: Better Data Partition Implies Faster Convergence Rate

Add code
Oct 26, 2018
Figure 1 for Proximal SCOPE for Distributed Sparse Learning: Better Data Partition Implies Faster Convergence Rate
Figure 2 for Proximal SCOPE for Distributed Sparse Learning: Better Data Partition Implies Faster Convergence Rate
Figure 3 for Proximal SCOPE for Distributed Sparse Learning: Better Data Partition Implies Faster Convergence Rate
Viaarxiv icon

Feature-Distributed SVRG for High-Dimensional Linear Classification

Add code
Feb 10, 2018
Figure 1 for Feature-Distributed SVRG for High-Dimensional Linear Classification
Figure 2 for Feature-Distributed SVRG for High-Dimensional Linear Classification
Figure 3 for Feature-Distributed SVRG for High-Dimensional Linear Classification
Figure 4 for Feature-Distributed SVRG for High-Dimensional Linear Classification
Viaarxiv icon

Lock-Free Optimization for Non-Convex Problems

Add code
Dec 11, 2016
Figure 1 for Lock-Free Optimization for Non-Convex Problems
Figure 2 for Lock-Free Optimization for Non-Convex Problems
Viaarxiv icon