Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Measuring Forgetting of Memorized Training Examples

Jun 30, 2022

Matthew Jagielski, Om Thakkar, Florian Tramèr, Daphne Ippolito, Katherine Lee, Nicholas Carlini, Eric Wallace, Shuang Song, Abhradeep Thakurta, Nicolas Papernot(+1 more)

Figure 1 for Measuring Forgetting of Memorized Training Examples

Figure 2 for Measuring Forgetting of Memorized Training Examples

Figure 3 for Measuring Forgetting of Memorized Training Examples

Figure 4 for Measuring Forgetting of Memorized Training Examples

Share this with someone who'll enjoy it:

Abstract:Machine learning models exhibit two seemingly contradictory phenomena: training data memorization and various forms of forgetting. In memorization, models overfit specific training examples and become susceptible to privacy attacks. In forgetting, examples which appeared early in training are forgotten by the end. In this work, we connect these phenomena. We propose a technique to measure to what extent models ``forget'' the specifics of training examples, becoming less susceptible to privacy attacks on examples they have not seen recently. We show that, while non-convexity can prevent forgetting from happening in the worst-case, standard image and speech models empirically do forget examples over time. We identify nondeterminism as a potential explanation, showing that deterministically trained models do not forget. Our results suggest that examples seen early when training with extremely large datasets -- for instance those examples used to pre-train a model -- may observe privacy benefits at the expense of examples seen later.

* 19 pages, 7 figures

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Measuring Forgetting of Memorized Training Examples

Paper and Code