Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Disentangled Continual Learning: Separating Memory Edits from Model Updates

Dec 27, 2023

Sebastian Dziadzio, Çağatay Yıldız, Gido M. van de Ven, Tomasz Trzciński, Tinne Tuytelaars, Matthias Bethge

Figure 1 for Disentangled Continual Learning: Separating Memory Edits from Model Updates

Figure 2 for Disentangled Continual Learning: Separating Memory Edits from Model Updates

Figure 3 for Disentangled Continual Learning: Separating Memory Edits from Model Updates

Figure 4 for Disentangled Continual Learning: Separating Memory Edits from Model Updates

Share this with someone who'll enjoy it:

Abstract:The ability of machine learning systems to learn continually is hindered by catastrophic forgetting, the tendency of neural networks to overwrite existing knowledge when learning a new task. Existing continual learning methods alleviate this problem through regularisation, parameter isolation, or rehearsal, and are typically evaluated on benchmarks consisting of a handful of tasks. We propose a novel conceptual approach to continual classification that aims to disentangle class-specific information that needs to be memorised from the class-agnostic knowledge that encapsulates generalization. We store the former in a buffer that can be easily pruned or updated when new categories arrive, while the latter is represented with a neural network that generalizes across tasks. We show that the class-agnostic network does not suffer from catastrophic forgetting and by leveraging it to perform classification, we improve accuracy on past tasks over time. In addition, our approach supports open-set classification and one-shot generalization. To test our conceptual framework, we introduce Infinite dSprites, a tool for creating continual classification and disentanglement benchmarks of arbitrary length with full control over generative factors. We show that over a sufficiently long time horizon all major types of continual learning methods break down, while our approach enables continual learning over hundreds of tasks with explicit control over memorization and forgetting.

* 12 pages, 11 figures

View paper on

Share this with someone who'll enjoy it:

Title:Disentangled Continual Learning: Separating Memory Edits from Model Updates

Paper and Code