Optimal Multi-Distribution Learning

Zhang, Zihan; Zhan, Wenhao; Chen, Yuxin; Du, Simon S.; Lee, Jason D.

Computer Science > Machine Learning

arXiv:2312.05134v4 (cs)

[Submitted on 8 Dec 2023 (v1), last revised 23 May 2024 (this version, v4)]

Title:Optimal Multi-Distribution Learning

Authors:Zihan Zhang, Wenhao Zhan, Yuxin Chen, Simon S. Du, Jason D. Lee

View PDF HTML (experimental)

Abstract:Multi-distribution learning (MDL), which seeks to learn a shared model that minimizes the worst-case risk across $k$ distinct data distributions, has emerged as a unified framework in response to the evolving demand for robustness, fairness, multi-group collaboration, etc. Achieving data-efficient MDL necessitates adaptive sampling, also called on-demand sampling, throughout the learning process. However, there exist substantial gaps between the state-of-the-art upper and lower bounds on the optimal sample complexity. Focusing on a hypothesis class of Vapnik-Chervonenkis (VC) dimension d, we propose a novel algorithm that yields an varepsilon-optimal randomized hypothesis with a sample complexity on the order of (d+k)/varepsilon^2 (modulo some logarithmic factor), matching the best-known lower bound. Our algorithmic ideas and theory are further extended to accommodate Rademacher classes. The proposed algorithms are oracle-efficient, which access the hypothesis class solely through an empirical risk minimization oracle.
Additionally, we establish the necessity of randomization, revealing a large sample size barrier when only deterministic hypotheses are permitted. These findings resolve three open problems presented in COLT 2023 (i.e., citet[Problems 1, 3 and 4]{awasthi2023sample}).

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2312.05134 [cs.LG]
	(or arXiv:2312.05134v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2312.05134

Submission history

From: Zihan Zhang [view email]
[v1] Fri, 8 Dec 2023 16:06:29 UTC (1,555 KB)
[v2] Sat, 20 Jan 2024 17:04:34 UTC (1,476 KB)
[v3] Wed, 15 May 2024 07:29:44 UTC (1,526 KB)
[v4] Thu, 23 May 2024 16:28:21 UTC (1,526 KB)

Computer Science > Machine Learning

Title:Optimal Multi-Distribution Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Optimal Multi-Distribution Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators