Auditing: Active Learning with Outcome-Dependent Query Costs

Sabato, Sivan; Sarwate, Anand D.; Srebro, Nathan

Computer Science > Machine Learning

arXiv:1306.2347 (cs)

[Submitted on 10 Jun 2013 (v1), last revised 12 Jul 2015 (this version, v4)]

Title:Auditing: Active Learning with Outcome-Dependent Query Costs

Authors:Sivan Sabato, Anand D. Sarwate, Nathan Srebro

View PDF

Abstract:We propose a learning setting in which unlabeled data is free, and the cost of a label depends on its value, which is not known in advance. We study binary classification in an extreme case, where the algorithm only pays for negative labels. Our motivation are applications such as fraud detection, in which investigating an honest transaction should be avoided if possible. We term the setting auditing, and consider the auditing complexity of an algorithm: the number of negative labels the algorithm requires in order to learn a hypothesis with low relative error. We design auditing algorithms for simple hypothesis classes (thresholds and rectangles), and show that with these algorithms, the auditing complexity can be significantly lower than the active label complexity. We also discuss a general competitive approach for auditing and possible modifications to the framework.

Comments:	Corrections in section 5
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1306.2347 [cs.LG]
	(or arXiv:1306.2347v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1306.2347
Journal reference:	Neural Information Processing Systems 26 (NIPS), 512-520, 2013

Submission history

From: Sivan Sabato [view email]
[v1] Mon, 10 Jun 2013 20:18:48 UTC (122 KB)
[v2] Fri, 27 Sep 2013 17:57:33 UTC (122 KB)
[v3] Tue, 15 Oct 2013 18:27:07 UTC (120 KB)
[v4] Sun, 12 Jul 2015 10:11:57 UTC (120 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2013-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Sivan Sabato
Anand D. Sarwate
Nathan Srebro

export BibTeX citation

Computer Science > Machine Learning

Title:Auditing: Active Learning with Outcome-Dependent Query Costs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Auditing: Active Learning with Outcome-Dependent Query Costs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators