Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nikolay Sakharnykh

A Data-Centric Approach for Training Deep Neural Networks with Less Data

Oct 29, 2021

Mohammad Motamedi, Nikolay Sakharnykh, Tim Kaldewey

Figure 1 for A Data-Centric Approach for Training Deep Neural Networks with Less Data

Figure 2 for A Data-Centric Approach for Training Deep Neural Networks with Less Data

Figure 3 for A Data-Centric Approach for Training Deep Neural Networks with Less Data

Figure 4 for A Data-Centric Approach for Training Deep Neural Networks with Less Data

Abstract:While the availability of large datasets is perceived to be a key requirement for training deep neural networks, it is possible to train such models with relatively little data. However, compensating for the absence of large datasets demands a series of actions to enhance the quality of the existing samples and to generate new ones. This paper summarizes our winning submission to the "Data-Centric AI" competition. We discuss some of the challenges that arise while training with a small dataset, offer a principled approach for systematic data quality enhancement, and propose a GAN-based solution for synthesizing new data points. Our evaluations indicate that the dataset generated by the proposed pipeline offers 5% accuracy improvement while being significantly smaller than the baseline.

* 5 pages, 2 figures

Via

Access Paper or Ask Questions