Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Mar 31, 2023

Ye Tian, Yuqi Gu, Yang Feng

Figure 1 for Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Figure 2 for Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Figure 3 for Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Figure 4 for Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Share this with someone who'll enjoy it:

Abstract:Representation multi-task learning (MTL) and transfer learning (TL) have achieved tremendous success in practice. However, the theoretical understanding of these methods is still lacking. Most existing theoretical works focus on cases where all tasks share the same representation, and claim that MTL and TL almost always improve performance. However, as the number of tasks grow, assuming all tasks share the same representation is unrealistic. Also, this does not always match empirical findings, which suggest that a shared representation may not necessarily improve single-task or target-only learning performance. In this paper, we aim to understand how to learn from tasks with \textit{similar but not exactly the same} linear representations, while dealing with outlier tasks. We propose two algorithms that are \textit{adaptive} to the similarity structure and \textit{robust} to outlier tasks under both MTL and TL settings. Our algorithms outperform single-task or target-only learning when representations across tasks are sufficiently similar and the fraction of outlier tasks is small. Furthermore, they always perform no worse than single-task learning or target-only learning, even when the representations are dissimilar. We provide information-theoretic lower bounds to show that our algorithms are nearly \textit{minimax} optimal in a large regime.

* 60 pages, 5 figures

View paper on

Share this with someone who'll enjoy it:

Title:Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Paper and Code