Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Inadequately Pre-trained Models are Better Feature Extractors

Mar 09, 2022

Andong Deng, Xingjian Li, Zhibing Li, Di Hu, Chengzhong Xu, Dejing Dou

Figure 1 for Inadequately Pre-trained Models are Better Feature Extractors

Figure 2 for Inadequately Pre-trained Models are Better Feature Extractors

Figure 3 for Inadequately Pre-trained Models are Better Feature Extractors

Figure 4 for Inadequately Pre-trained Models are Better Feature Extractors

Share this with someone who'll enjoy it:

Abstract:Pre-training has been a popular learning paradigm in deep learning era, especially in annotation-insufficient scenario. Better ImageNet pre-trained models have been demonstrated, from the perspective of architecture, by previous research to have better transferability to downstream tasks. However, in this paper, we found that during the same pre-training process, models at middle epochs, which is inadequately pre-trained, can outperform fully trained models when used as feature extractors (FE), while the fine-tuning (FT) performance still grows with the source performance. This reveals that there is not a solid positive correlation between top-1 accuracy on ImageNet and the transferring result on target data. Based on the contradictory phenomenon between FE and FT that better feature extractor fails to be fine-tuned better accordingly, we conduct comprehensive analyses on features before softmax layer to provide insightful explanations. Our discoveries suggest that, during pre-training, models tend to first learn spectral components corresponding to large singular values and the residual components contribute more when fine-tuning.

View paper on

Share this with someone who'll enjoy it:

Title:Inadequately Pre-trained Models are Better Feature Extractors

Paper and Code