Quantification under prior probability shift: the ratio estimator and its extensions

Vaz, Afonso Fernandes; Izbicki, Rafael; Stern, Rafael Bassi

Statistics > Machine Learning

arXiv:1807.03929 (stat)

[Submitted on 11 Jul 2018 (v1), last revised 5 Apr 2019 (this version, v2)]

Title:Quantification under prior probability shift: the ratio estimator and its extensions

Authors:Afonso Fernandes Vaz, Rafael Izbicki, Rafael Bassi Stern

View PDF

Abstract:The quantification problem consists of determining the prevalence of a given label in a target population. However, one often has access to the labels in a sample from the training population but not in the target population. A common assumption in this situation is that of prior probability shift, that is, once the labels are known, the distribution of the features is the same in the training and target populations. In this paper, we derive a new lower bound for the risk of the quantification problem under the prior shift assumption. Complementing this lower bound, we present a new approximately minimax class of estimators, ratio estimators, which generalize several previous proposals in the literature. Using a weaker version of the prior shift assumption, which can be tested, we show that ratio estimators can be used to build confidence intervals for the quantification problem. We also extend the ratio estimator so that it can: (i) incorporate labels from the target population, when they are available and (ii) estimate how the prevalence of positive labels varies according to a function of certain covariates.

Comments:	33 pages, 15 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
MSC classes:	62F12, 62G05, 62G08
Cite as:	arXiv:1807.03929 [stat.ML]
	(or arXiv:1807.03929v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1807.03929

Submission history

From: Rafael Stern [view email]
[v1] Wed, 11 Jul 2018 02:19:57 UTC (1,420 KB)
[v2] Fri, 5 Apr 2019 17:18:03 UTC (2,277 KB)

Statistics > Machine Learning

Title:Quantification under prior probability shift: the ratio estimator and its extensions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Quantification under prior probability shift: the ratio estimator and its extensions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators