Unique Rashomon Sets for Robust Active Learning

Nugyen, Simon; Hoffman, Kentaro; McCormick, Tyler

Statistics > Machine Learning

arXiv:2503.06770v1 (stat)

[Submitted on 9 Mar 2025 (this version), latest version 12 Mar 2025 (v2)]

Title:Unique Rashomon Sets for Robust Active Learning

Authors:Simon Nugyen, Kentaro Hoffman, Tyler McCormick

View PDF HTML (experimental)

Abstract:Collecting labeled data for machine learning models is often expensive and time-consuming. Active learning addresses this challenge by selectively labeling the most informative observations, but when initial labeled data is limited, it becomes difficult to distinguish genuinely informative points from those appearing uncertain primarily due to noise. Ensemble methods like random forests are a powerful approach to quantifying this uncertainty but do so by aggregating all models indiscriminately. This includes poor performing models and redundant models, a problem that worsens in the presence of noisy data. We introduce UNique Rashomon Ensembled Active Learning (UNREAL), which selectively ensembles only distinct models from the Rashomon set, which is the set of nearly optimal models. Restricting ensemble membership to high-performing models with different explanations helps distinguish genuine uncertainty from noise-induced variation. We show that UNREAL achieves faster theoretical convergence rates than traditional active learning approaches and demonstrates empirical improvements of up to 20% in predictive accuracy across five benchmark datasets, while simultaneously enhancing model interpretability.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2503.06770 [stat.ML]
	(or arXiv:2503.06770v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2503.06770

Submission history

From: Kentaro Hoffman [view email]
[v1] Sun, 9 Mar 2025 20:50:34 UTC (3,347 KB)
[v2] Wed, 12 Mar 2025 01:53:55 UTC (3,347 KB)

Statistics > Machine Learning

Title:Unique Rashomon Sets for Robust Active Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Unique Rashomon Sets for Robust Active Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators