Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Information FOMO: The unhealthy fear of missing out on information. A method for removing misleading data for healthier models

Aug 27, 2022

Ethan Pickering, Themistoklis P. Sapsis

Figure 1 for Information FOMO: The unhealthy fear of missing out on information. A method for removing misleading data for healthier models

Figure 2 for Information FOMO: The unhealthy fear of missing out on information. A method for removing misleading data for healthier models

Figure 3 for Information FOMO: The unhealthy fear of missing out on information. A method for removing misleading data for healthier models

Figure 4 for Information FOMO: The unhealthy fear of missing out on information. A method for removing misleading data for healthier models

Share this with someone who'll enjoy it:

Abstract:Not all data are equal. Misleading or unnecessary data can critically hinder the accuracy of Machine Learning (ML) models. When data is plentiful, misleading effects can be overcome, but in many real-world applications data is sparse and expensive to acquire. We present a method that substantially reduces the data size necessary to accurately train ML models, potentially opening the door for many new, limited-data applications in ML. Our method extracts the most informative data, while ignoring and omitting data that misleads the ML model to inferior generalization properties. Specifically, the method eliminates the phenomena of "double descent", where more data leads to worse performance. This approach brings several key features to the ML community. Notably, the method naturally converges and removes the traditional need to divide the dataset into training, testing, and validation data. Instead, the selection metric inherently assesses testing error. This ensures that key information is never wasted in testing or validation.

* 8 pages, 6 figures

View paper on

Share this with someone who'll enjoy it:

Title:Information FOMO: The unhealthy fear of missing out on information. A method for removing misleading data for healthier models

Paper and Code