Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nestor Demeure

Ranger21: a synergistic deep learning optimizer

Jun 25, 2021

Less Wright, Nestor Demeure

Figure 1 for Ranger21: a synergistic deep learning optimizer

Figure 2 for Ranger21: a synergistic deep learning optimizer

Figure 3 for Ranger21: a synergistic deep learning optimizer

Figure 4 for Ranger21: a synergistic deep learning optimizer

Abstract:As optimizers are critical to the performances of neural networks, every year a large number of papers innovating on the subject are published. However, while most of these publications provide incremental improvements to existing algorithms, they tend to be presented as new optimizers rather than composable algorithms. Thus, many worthwhile improvements are rarely seen out of their initial publication. Taking advantage of this untapped potential, we introduce Ranger21, a new optimizer which combines AdamW with eight components, carefully selected after reviewing and testing ideas from the literature. We found that the resulting optimizer provides significantly improved validation accuracy and training speed, smoother training curves, and is even able to train a ResNet50 on ImageNet2012 without Batch Normalization layers. A problem on which AdamW stays systematically stuck in a bad initial state.

* for associated code, see https://github.com/lessw2020/Ranger21

Via

Access Paper or Ask Questions