Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Monte Lunacek

An Empirical Deep Dive into Deep Learning's Driving Dynamics

Jul 25, 2022

Charles Edison Tripp, Jordan Perr-Sauer, Lucas Hayne, Monte Lunacek

Figure 1 for An Empirical Deep Dive into Deep Learning's Driving Dynamics

Figure 2 for An Empirical Deep Dive into Deep Learning's Driving Dynamics

Figure 3 for An Empirical Deep Dive into Deep Learning's Driving Dynamics

Figure 4 for An Empirical Deep Dive into Deep Learning's Driving Dynamics

Abstract:We present an empirical dataset surveying the deep learning phenomenon on fully-connected networks, encompassing the training and test performance of numerous network topologies, sweeping across multiple learning tasks, depths, numbers of free parameters, learning rates, batch sizes, and regularization penalties. The dataset probes 178 thousand hyperparameter settings with an average of 20 repetitions each, totaling 3.5 million training runs and 20 performance metrics for each of the 13.1 billion training epochs observed. Accumulating this 671 GB dataset utilized 5,448 CPU core-years, 17.8 GPU-years, and 111.2 node-years. Additionally, we provide a preliminary analysis revealing patterns which persist across learning tasks and topologies. We aim to inspire work empirically studying modern machine learning techniques as a catalyst for the theoretical discoveries needed to progress the field beyond energy-intensive and heuristic practices.

Via

Access Paper or Ask Questions