Robust PAC$^m$: Training Ensemble Models Under Misspecification and Outliers

Zecchin, Matteo; Park, Sangwoo; Simeone, Osvaldo; Kountouris, Marios; Gesbert, David

Computer Science > Machine Learning

arXiv:2203.01859 (cs)

[Submitted on 3 Mar 2022 (v1), last revised 23 Apr 2023 (this version, v3)]

Title:Robust PAC$^m$: Training Ensemble Models Under Misspecification and Outliers

Authors:Matteo Zecchin, Sangwoo Park, Osvaldo Simeone, Marios Kountouris, David Gesbert

View PDF

Abstract:Standard Bayesian learning is known to have suboptimal generalization capabilities under misspecification and in the presence of outliers. PAC-Bayes theory demonstrates that the free energy criterion minimized by Bayesian learning is a bound on the generalization error for Gibbs predictors (i.e., for single models drawn at random from the posterior) under the assumption of sampling distributions uncontaminated by outliers. This viewpoint provides a justification for the limitations of Bayesian learning when the model is misspecified, requiring ensembling, and when data is affected by outliers. In recent work, PAC-Bayes bounds -- referred to as PAC$^m$ -- were derived to introduce free energy metrics that account for the performance of ensemble predictors, obtaining enhanced performance under misspecification. This work presents a novel robust free energy criterion that combines the generalized logarithm score function with PAC$^m$ ensemble bounds. The proposed free energy training criterion produces predictive distributions that are able to concurrently counteract the detrimental effects of misspecification -- with respect to both likelihood and prior distribution -- and outliers.

Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:2203.01859 [cs.LG]
	(or arXiv:2203.01859v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.01859

Submission history

From: Matteo Zecchin [view email]
[v1] Thu, 3 Mar 2022 17:11:07 UTC (7,208 KB)
[v2] Tue, 28 Mar 2023 16:34:26 UTC (8,007 KB)
[v3] Sun, 23 Apr 2023 15:12:44 UTC (8,193 KB)

Computer Science > Machine Learning

Title:Robust PAC$^m$: Training Ensemble Models Under Misspecification and Outliers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Robust PAC$^m$: Training Ensemble Models Under Misspecification and Outliers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators