Instance-Dependent Generalization Bounds via Optimal Transport

Hou, Songyan; Kassraie, Parnian; Kratsios, Anastasis; Rothfuss, Jonas; Krause, Andreas

Statistics > Machine Learning

arXiv:2211.01258v2 (stat)

[Submitted on 2 Nov 2022 (v1), revised 7 Nov 2022 (this version, v2), latest version 13 Nov 2023 (v4)]

Title:Instance-Dependent Generalization Bounds via Optimal Transport

Authors:Songyan Hou, Parnian Kassraie, Anastasis Kratsios, Jonas Rothfuss, Andreas Krause

View PDF

Abstract:Existing generalization bounds fail to explain crucial factors that drive generalization of modern neural networks. Since such bounds often hold uniformly over all parameters, they suffer from over-parametrization, and fail to account for the strong inductive bias of initialization and stochastic gradient descent. As an alternative, we propose a novel optimal transport interpretation of the generalization problem. This allows us to derive instance-dependent generalization bounds that depend on the local Lipschitz regularity of the earned prediction function in the data space. Therefore, our bounds are agnostic to the parametrization of the model and work well when the number of training samples is much smaller than the number of parameters. With small modifications, our approach yields accelerated rates for data on low-dimensional manifolds, and guarantees under distribution shifts. We empirically analyze our generalization bounds for neural networks, showing that the bound values are meaningful and capture the effect of popular regularization methods during training.

Comments:	50 pages, 7 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2211.01258 [stat.ML]
	(or arXiv:2211.01258v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2211.01258

Submission history

From: Parnian Kassraie [view email]
[v1] Wed, 2 Nov 2022 16:39:42 UTC (4,078 KB)
[v2] Mon, 7 Nov 2022 17:07:31 UTC (4,079 KB)
[v3] Wed, 2 Aug 2023 13:52:37 UTC (322 KB)
[v4] Mon, 13 Nov 2023 13:37:52 UTC (2,083 KB)

Statistics > Machine Learning

Title:Instance-Dependent Generalization Bounds via Optimal Transport

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Instance-Dependent Generalization Bounds via Optimal Transport

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators