Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:The Value of Out-of-Distribution Data

Aug 23, 2022

Ashwin De Silva, Rahul Ramesh, Carey E. Priebe, Pratik Chaudhari, Joshua T. Vogelstein

Figure 1 for The Value of Out-of-Distribution Data

Figure 2 for The Value of Out-of-Distribution Data

Figure 3 for The Value of Out-of-Distribution Data

Figure 4 for The Value of Out-of-Distribution Data

Share this with someone who'll enjoy it:

Abstract:More data helps us generalize to a task. But real datasets can contain out-of-distribution (OOD) data; this can come in the form of heterogeneity such as intra-class variability but also in the form of temporal shifts or concept drifts. We demonstrate a counter-intuitive phenomenon for such problems: generalization error of the task can be a non-monotonic function of the number of OOD samples; a small number of OOD samples can improve generalization but if the number of OOD samples is beyond a threshold, then the generalization error can deteriorate. We also show that if we know which samples are OOD, then using a weighted objective between the target and OOD samples ensures that the generalization error decreases monotonically. We demonstrate and analyze this issue using linear classifiers on synthetic datasets and medium-sized neural networks on CIFAR-10.

* To be presented as a short paper at the Out-of-Distribution Generalization in Computer Vision (OOD-CV) workshop, ECCV 2022, Tel Aviv, Israel

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:The Value of Out-of-Distribution Data

Paper and Code