Improving uplift model evaluation on RCT data

Bokelmann, Björn; Lessmann, Stefan

Statistics > Methodology

arXiv:2210.02152v1 (stat)

[Submitted on 5 Oct 2022 (this version), latest version 16 Dec 2022 (v3)]

Title:Improving uplift model evaluation on RCT data

Authors:Björn Bokelmann, Stefan Lessmann

View PDF

Abstract:Estimating treatment effects is one of the most challenging and important tasks of data analysts. Traditional statistical methods aim to estimate average treatment effects over a population. While being highly useful, such average treatment effects do not help to decide which individuals profit most by the treatment. This is where uplift modeling becomes important. Uplift models help to select the right individuals for treatment, to maximize the overall treatment effect (uplift). A challenging problem in uplift modeling is to evaluate the models. Previous literature suggests methods like the Qini curve and the transformed outcome mean squared error. However, these metrics suffer from variance: Their evaluations are strongly affected by random noise in the data, which makes these evaluations to a certain degree arbitrary. In this paper, we analyze the variance of the uplift evaluation metrics, on randomized controlled trial data, in a sound statistical manner. We propose certain outcome adjustment methods, for which we prove theoretically and empirically, that they reduce the variance of the uplift evaluation metrics. Our statistical analysis and the proposed outcome adjustment methods are a step towards a better evaluation practice in uplift modeling.

Subjects:	Methodology (stat.ME); Machine Learning (stat.ML)
Cite as:	arXiv:2210.02152 [stat.ME]
	(or arXiv:2210.02152v1 [stat.ME] for this version)
	https://doi.org/10.48550/arXiv.2210.02152

Submission history

From: Björn Bokelmann [view email]
[v1] Wed, 5 Oct 2022 11:16:01 UTC (697 KB)
[v2] Sun, 13 Nov 2022 20:20:49 UTC (533 KB)
[v3] Fri, 16 Dec 2022 14:17:24 UTC (554 KB)

Statistics > Methodology

Title:Improving uplift model evaluation on RCT data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Methodology

Title:Improving uplift model evaluation on RCT data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators