Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daniel Shwartz

United We Stand: Using Epoch-wise Agreement of Ensembles to Combat Overfit

Oct 17, 2023

Uri Stern, Daniel Shwartz, Daphna Weinshall

Figure 1 for United We Stand: Using Epoch-wise Agreement of Ensembles to Combat Overfit

Figure 2 for United We Stand: Using Epoch-wise Agreement of Ensembles to Combat Overfit

Figure 3 for United We Stand: Using Epoch-wise Agreement of Ensembles to Combat Overfit

Figure 4 for United We Stand: Using Epoch-wise Agreement of Ensembles to Combat Overfit

Abstract:Deep neural networks have become the method of choice for solving many image classification tasks, largely because they can fit very complex functions defined over raw images. The downside of such powerful learners is the danger of overfitting the training set, leading to poor generalization, which is usually avoided by regularization and "early stopping" of the training. In this paper, we propose a new deep network ensemble classifier that is very effective against overfit. We begin with the theoretical analysis of a regression model, whose predictions - that the variance among classifiers increases when overfit occurs - is demonstrated empirically in deep networks in common use. Guided by these results, we construct a new ensemble-based prediction method designed to combat overfit, where the prediction is determined by the most consensual prediction throughout the training. On multiple image and text classification datasets, we show that when regular ensembles suffer from overfit, our method eliminates the harmful reduction in generalization due to overfit, and often even surpasses the performance obtained by early stopping. Our method is easy to implement, and can be integrated with any training scheme and architecture, without additional prior knowledge beyond the training set. Accordingly, it is a practical and useful tool to overcome overfit.

Via

Access Paper or Ask Questions

The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels

Oct 02, 2022

Daniel Shwartz, Uri Stern, Daphna Weinshall

Figure 1 for The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels

Figure 2 for The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels

Figure 3 for The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels

Figure 4 for The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels

Abstract:Deep neural networks have incredible capacity and expressibility, and can seemingly memorize any training set. This introduces a problem when training in the presence of noisy labels, as the noisy examples cannot be distinguished from clean examples by the end of training. Recent research has dealt with this challenge by utilizing the fact that deep networks seem to memorize clean examples much earlier than noisy examples. Here we report a new empirical result: for each example, when looking at the time it has been memorized by each model in an ensemble of networks, the diversity seen in noisy examples is much larger than the clean examples. We use this observation to develop a new method for noisy labels filtration. The method is based on a statistics of the data, which captures the differences in ensemble learning dynamics between clean and noisy data. We test our method on three tasks: (i) noise amount estimation; (ii) noise filtration; (iii) supervised classification. We show that our method improves over existing baselines in all three tasks using a variety of datasets, noise models, and noise levels. Aside from its improved performance, our method has two other advantages. (i) Simplicity, which implies that no additional hyperparameters are introduced. (ii) Our method is modular: it does not work in an end-to-end fashion, and can therefore be used to clean a dataset for any other future usage.

Via

Access Paper or Ask Questions