The Cost of Replicability in Active Learning

Hira, Rupkatha; Kau, Dominik; Sorrell, Jessica

Computer Science > Machine Learning

arXiv:2412.09686 (cs)

[Submitted on 12 Dec 2024]

Title:The Cost of Replicability in Active Learning

Authors:Rupkatha Hira, Dominik Kau, Jessica Sorrell

View PDF HTML (experimental)

Abstract:Active learning aims to reduce the required number of labeled data for machine learning algorithms by selectively querying the labels of initially unlabeled data points. Ensuring the replicability of results, where an algorithm consistently produces the same outcome across different runs, is essential for the reliability of machine learning models but often increases sample complexity. This report investigates the cost of replicability in active learning using the CAL algorithm, a classical disagreement-based active learning method. By integrating replicable statistical query subroutines and random thresholding techniques, we propose two versions of a replicable CAL algorithm. Our theoretical analysis demonstrates that while replicability does increase label complexity, the CAL algorithm can still achieve significant savings in label complexity even with the replicability constraint. These findings offer valuable insights into balancing efficiency and robustness in machine learning models.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2412.09686 [cs.LG]
	(or arXiv:2412.09686v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.09686

Submission history

From: Rupkatha Hira [view email]
[v1] Thu, 12 Dec 2024 19:03:31 UTC (27 KB)

Computer Science > Machine Learning

Title:The Cost of Replicability in Active Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Cost of Replicability in Active Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators