Empirical Comparison between Cross-Validation and Mutation-Validation in Model Selection

Yu, Jinyang; Hamdan, Sami; Sasse, Leonard; Morrison, Abigail; Patil, Kaustubh R.

Computer Science > Machine Learning

arXiv:2311.14079 (cs)

[Submitted on 23 Nov 2023 (v1), last revised 15 Feb 2024 (this version, v2)]

Title:Empirical Comparison between Cross-Validation and Mutation-Validation in Model Selection

Authors:Jinyang Yu, Sami Hamdan, Leonard Sasse, Abigail Morrison, Kaustubh R. Patil

View PDF

Abstract:Mutation validation (MV) is a recently proposed approach for model selection, garnering significant interest due to its unique characteristics and potential benefits compared to the widely used cross-validation (CV) method. In this study, we empirically compared MV and $k$-fold CV using benchmark and real-world datasets. By employing Bayesian tests, we compared generalization estimates yielding three posterior probabilities: practical equivalence, CV superiority, and MV superiority. We also evaluated the differences in the capacity of the selected models and computational efficiency. We found that both MV and CV select models with practically equivalent generalization performance across various machine learning algorithms and the majority of benchmark datasets. MV exhibited advantages in terms of selecting simpler models and lower computational costs. However, in some cases MV selected overly simplistic models leading to underfitting and showed instability in hyperparameter selection. These limitations of MV became more evident in the evaluation of a real-world neuroscientific task of predicting sex at birth using brain functional connectivity.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2311.14079 [cs.LG]
	(or arXiv:2311.14079v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.14079

Submission history

From: Jinyang Yu [view email]
[v1] Thu, 23 Nov 2023 16:14:24 UTC (8,712 KB)
[v2] Thu, 15 Feb 2024 16:28:57 UTC (11,247 KB)

Computer Science > Machine Learning

Title:Empirical Comparison between Cross-Validation and Mutation-Validation in Model Selection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Empirical Comparison between Cross-Validation and Mutation-Validation in Model Selection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators