Picture for Ankur Agrawal

Ankur Agrawal

Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks

Add code
Jan 19, 2019
Figure 1 for Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks
Figure 2 for Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks
Figure 3 for Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks
Figure 4 for Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks
Viaarxiv icon

AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training

Add code
Dec 07, 2017
Figure 1 for AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training
Figure 2 for AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training
Figure 3 for AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training
Figure 4 for AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training
Viaarxiv icon

Deep Learning with Limited Numerical Precision

Add code
Feb 09, 2015
Figure 1 for Deep Learning with Limited Numerical Precision
Figure 2 for Deep Learning with Limited Numerical Precision
Figure 3 for Deep Learning with Limited Numerical Precision
Figure 4 for Deep Learning with Limited Numerical Precision
Viaarxiv icon