Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Alleviating Representational Shift for Continual Fine-tuning

Apr 22, 2022

Shibo Jie, Zhi-Hong Deng, Ziheng Li

Figure 1 for Alleviating Representational Shift for Continual Fine-tuning

Figure 2 for Alleviating Representational Shift for Continual Fine-tuning

Figure 3 for Alleviating Representational Shift for Continual Fine-tuning

Figure 4 for Alleviating Representational Shift for Continual Fine-tuning

Share this with someone who'll enjoy it:

Abstract:We study a practical setting of continual learning: fine-tuning on a pre-trained model continually. Previous work has found that, when training on new tasks, the features (penultimate layer representations) of previous data will change, called representational shift. Besides the shift of features, we reveal that the intermediate layers' representational shift (IRS) also matters since it disrupts batch normalization, which is another crucial cause of catastrophic forgetting. Motivated by this, we propose ConFiT, a fine-tuning method incorporating two components, cross-convolution batch normalization (Xconv BN) and hierarchical fine-tuning. Xconv BN maintains pre-convolution running means instead of post-convolution, and recovers post-convolution ones before testing, which corrects the inaccurate estimates of means under IRS. Hierarchical fine-tuning leverages a multi-stage strategy to fine-tune the pre-trained network, preventing massive changes in Conv layers and thus alleviating IRS. Experimental results on four datasets show that our method remarkably outperforms several state-of-the-art methods with lower storage overhead.

View paper on

Share this with someone who'll enjoy it:

Title:Alleviating Representational Shift for Continual Fine-tuning

Paper and Code