Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Gradient Projection Memory for Continual Learning

Mar 17, 2021

Gobinda Saha, Isha Garg, Kaushik Roy

Figure 1 for Gradient Projection Memory for Continual Learning

Figure 2 for Gradient Projection Memory for Continual Learning

Figure 3 for Gradient Projection Memory for Continual Learning

Figure 4 for Gradient Projection Memory for Continual Learning

Share this with someone who'll enjoy it:

Abstract:The ability to learn continually without forgetting the past tasks is a desired attribute for artificial learning systems. Existing approaches to enable such learning in artificial neural networks usually rely on network growth, importance based weight update or replay of old data from the memory. In contrast, we propose a novel approach where a neural network learns new tasks by taking gradient steps in the orthogonal direction to the gradient subspaces deemed important for the past tasks. We find the bases of these subspaces by analyzing network representations (activations) after learning each task with Singular Value Decomposition (SVD) in a single shot manner and store them in the memory as Gradient Projection Memory (GPM). With qualitative and quantitative analyses, we show that such orthogonal gradient descent induces minimum to no interference with the past tasks, thereby mitigates forgetting. We evaluate our algorithm on diverse image classification datasets with short and long sequences of tasks and report better or on-par performance compared to the state-of-the-art approaches.

* International Conference on Learning Representations (ICLR), 2021 * Accepted for Oral Presentation at ICLR 2021 https://openreview.net/forum?id=3AOj0RCNC2

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Gradient Projection Memory for Continual Learning

Paper and Code