Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Data optimization for large batch distributed training of deep neural networks

Dec 18, 2020

Shubhankar Gahlot, Junqi Yin, Mallikarjun Shankar

Figure 1 for Data optimization for large batch distributed training of deep neural networks

Figure 2 for Data optimization for large batch distributed training of deep neural networks

Figure 3 for Data optimization for large batch distributed training of deep neural networks

Figure 4 for Data optimization for large batch distributed training of deep neural networks

Share this with someone who'll enjoy it:

Abstract:Distributed training in deep learning (DL) is common practice as data and models grow. The current practice for distributed training of deep neural networks faces the challenges of communication bottlenecks when operating at scale, and model accuracy deterioration with an increase in global batch size. Present solutions focus on improving message exchange efficiency as well as implementing techniques to tweak batch sizes and models in the training process. The loss of training accuracy typically happens because the loss function gets trapped in a local minima. We observe that the loss landscape minimization is shaped by both the model and training data and propose a data optimization approach that utilizes machine learning to implicitly smooth out the loss landscape resulting in fewer local minima. Our approach filters out data points which are less important to feature learning, enabling us to speed up the training of models on larger batch sizes to improved accuracy.

* Computational Science & Computational Intelligence (CSCI'20), 7 pages

View paper on

Share this with someone who'll enjoy it:

Title:Data optimization for large batch distributed training of deep neural networks

Paper and Code