Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Meta-learning Transferable Representations with a Single Target Domain

Nov 03, 2020

Hong Liu, Jeff Z. HaoChen, Colin Wei, Tengyu Ma

Figure 1 for Meta-learning Transferable Representations with a Single Target Domain

Figure 2 for Meta-learning Transferable Representations with a Single Target Domain

Figure 3 for Meta-learning Transferable Representations with a Single Target Domain

Figure 4 for Meta-learning Transferable Representations with a Single Target Domain

Share this with someone who'll enjoy it:

Abstract:Recent works found that fine-tuning and joint training---two popular approaches for transfer learning---do not always improve accuracy on downstream tasks. First, we aim to understand more about when and why fine-tuning and joint training can be suboptimal or even harmful for transfer learning. We design semi-synthetic datasets where the source task can be solved by either source-specific features or transferable features. We observe that (1) pre-training may not have incentive to learn transferable features and (2) joint training may simultaneously learn source-specific features and overfit to the target. Second, to improve over fine-tuning and joint training, we propose Meta Representation Learning (MeRLin) to learn transferable features. MeRLin meta-learns representations by ensuring that a head fit on top of the representations with target training data also performs well on target validation data. We also prove that MeRLin recovers the target ground-truth model with a quadratic neural net parameterization and a source distribution that contains both transferable and source-specific features. On the same distribution, pre-training and joint training provably fail to learn transferable features. MeRLin empirically outperforms previous state-of-the-art transfer learning algorithms on various real-world vision and NLP transfer learning benchmarks.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Meta-learning Transferable Representations with a Single Target Domain

Paper and Code