Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Tian, Ye; Gu, Yuqi; Feng, Yang

Statistics > Machine Learning

arXiv:2303.17765v2 (stat)

[Submitted on 31 Mar 2023 (v1), revised 16 Jul 2023 (this version, v2), latest version 9 Aug 2024 (v3)]

Title:Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Authors:Ye Tian, Yuqi Gu, Yang Feng

View PDF

Abstract:Representation multi-task learning (MTL) and transfer learning (TL) have achieved tremendous success in practice. However, the theoretical understanding of these methods is still lacking. Most existing theoretical works focus on cases where all tasks share the same representation, and claim that MTL and TL almost always improve performance. However, as the number of tasks grows, assuming all tasks share the same representation is unrealistic. Also, this does not always match empirical findings, which suggest that a shared representation may not necessarily improve single-task or target-only learning performance. In this paper, we aim to understand how to learn from tasks with \textit{similar but not exactly the same} linear representations, while dealing with outlier tasks. With a known intrinsic dimension, we propose two algorithms that are \textit{adaptive} to the similarity structure and \textit{robust} to outlier tasks under both MTL and TL settings. Our algorithms outperform single-task or target-only learning when representations across tasks are sufficiently similar and the fraction of outlier tasks is small. Furthermore, they always perform no worse than single-task learning or target-only learning, even when the representations are dissimilar. We provide information-theoretic lower bounds to show that our algorithms are nearly \textit{minimax} optimal in a large regime. We also propose an algorithm to adapt to the unknown intrinsic dimension. We conduct two simulation studies to verify our theoretical results.

Comments:	76 pages, 5 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2303.17765 [stat.ML]
	(or arXiv:2303.17765v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2303.17765

Submission history

From: Ye Tian [view email]
[v1] Fri, 31 Mar 2023 01:56:13 UTC (3,088 KB)
[v2] Sun, 16 Jul 2023 06:30:55 UTC (2,264 KB)
[v3] Fri, 9 Aug 2024 03:26:15 UTC (2,719 KB)

Statistics > Machine Learning

Title:Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators