Generalization Bounds via Conditional $f$-Information

Wang, Ziqiao; Mao, Yongyi

Statistics > Machine Learning

arXiv:2410.22887 (stat)

[Submitted on 30 Oct 2024]

Title:Generalization Bounds via Conditional $f$-Information

Authors:Ziqiao Wang, Yongyi Mao

View PDF HTML (experimental)

Abstract:In this work, we introduce novel information-theoretic generalization bounds using the conditional $f$-information framework, an extension of the traditional conditional mutual information (MI) framework. We provide a generic approach to derive generalization bounds via $f$-information in the supersample setting, applicable to both bounded and unbounded loss functions. Unlike previous MI-based bounds, our proof strategy does not rely on upper bounding the cumulant-generating function (CGF) in the variational formula of MI. Instead, we set the CGF or its upper bound to zero by carefully selecting the measurable function invoked in the variational formula. Although some of our techniques are partially inspired by recent advances in the coin-betting framework (e.g., Jang et al. (2023)), our results are independent of any previous findings from regret guarantees of online gambling algorithms. Additionally, our newly derived MI-based bound recovers many previous results and improves our understanding of their potential limitations. Finally, we empirically compare various $f$-information measures for generalization, demonstrating the improvement of our new bounds over the previous bounds.

Comments:	Accepted at NeurIPS 2024
Subjects:	Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG)
Cite as:	arXiv:2410.22887 [stat.ML]
	(or arXiv:2410.22887v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2410.22887

Submission history

From: Ziqiao Wang [view email]
[v1] Wed, 30 Oct 2024 10:33:07 UTC (131 KB)

Statistics > Machine Learning

Title:Generalization Bounds via Conditional $f$-Information

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Generalization Bounds via Conditional $f$-Information

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators