Post-Selection Confidence Bounds for Prediction Performance

Rink, Pascal; Brannath, Werner

Statistics > Machine Learning

arXiv:2210.13206 (stat)

[Submitted on 24 Oct 2022 (v1), last revised 3 Feb 2023 (this version, v3)]

Title:Post-Selection Confidence Bounds for Prediction Performance

Authors:Pascal Rink, Werner Brannath

View PDF

Abstract:In machine learning, the selection of a promising model from a potentially large number of competing models and the assessment of its generalization performance are critical tasks that need careful consideration. Typically, model selection and evaluation are strictly separated endeavors, splitting the sample at hand into a training, validation, and evaluation set, and only compute a single confidence interval for the prediction performance of the final selected model. We however propose an algorithm how to compute valid lower confidence bounds for multiple models that have been selected based on their prediction performances in the evaluation set by interpreting the selection problem as a simultaneous inference problem. We use bootstrap tilting and a maxT-type multiplicity correction. The approach is universally applicable for any combination of prediction models, any model selection strategy, and any prediction performance measure that accepts weights. We conducted various simulation experiments which show that our proposed approach yields lower confidence bounds that are at least comparably good as bounds from standard approaches, and that reliably reach the nominal coverage probability. In addition, especially when sample size is small, our proposed approach yields better performing prediction models than the default selection of only one model for evaluation does.

Comments:	17 pages, 13 figures, 3 tables. Submitted to the Springer Machine Learning Journal. Changes to version 2: made figures easier to read; corrected a minor typo
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2210.13206 [stat.ML]
	(or arXiv:2210.13206v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2210.13206

Submission history

From: Pascal Rink [view email]
[v1] Mon, 24 Oct 2022 13:28:43 UTC (2,872 KB)
[v2] Thu, 27 Oct 2022 11:32:11 UTC (1,789 KB)
[v3] Fri, 3 Feb 2023 09:02:12 UTC (3,218 KB)

Statistics > Machine Learning

Title:Post-Selection Confidence Bounds for Prediction Performance

Submission history

Access Paper:

Ancillary files (details):

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Post-Selection Confidence Bounds for Prediction Performance

Submission history

Access Paper:

Ancillary files (details):

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators