Improving uplift model evaluation on RCT data

Bokelmann, Björn; Lessmann, Stefan

Statistics > Methodology

arXiv:2210.02152 (stat)

[Submitted on 5 Oct 2022 (v1), last revised 16 Dec 2022 (this version, v3)]

Title:Improving uplift model evaluation on RCT data

Authors:Björn Bokelmann, Stefan Lessmann

View PDF

Abstract:Estimating treatment effects is one of the most challenging and important tasks of data analysts. In many applications, like online marketing and personalized medicine, treatment needs to be allocated to the individuals where it yields a high positive treatment effect. Uplift models help select the right individuals for treatment and maximize the overall treatment effect (uplift). A major challenge in uplift modeling concerns model evaluation. Previous literature suggests methods like the Qini curve and the transformed outcome mean squared error. However, these metrics suffer from variance: their evaluations are strongly affected by random noise in the data, which renders their signals, to a certain degree, arbitrary. We theoretically analyze the variance of uplift evaluation metrics and derive possible methods of variance reduction, which are based on statistical adjustment of the outcome. We derive simple conditions under which the variance reduction methods improve the uplift evaluation metrics and empirically demonstrate their benefits on simulated and real-world data. Our paper provides strong evidence in favor of applying the suggested variance reduction procedures by default when evaluating uplift models on RCT data.

Subjects:	Methodology (stat.ME); Machine Learning (stat.ML)
Cite as:	arXiv:2210.02152 [stat.ME]
	(or arXiv:2210.02152v3 [stat.ME] for this version)
	https://doi.org/10.48550/arXiv.2210.02152

Submission history

From: Björn Bokelmann [view email]
[v1] Wed, 5 Oct 2022 11:16:01 UTC (697 KB)
[v2] Sun, 13 Nov 2022 20:20:49 UTC (533 KB)
[v3] Fri, 16 Dec 2022 14:17:24 UTC (554 KB)

Statistics > Methodology

Title:Improving uplift model evaluation on RCT data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Methodology

Title:Improving uplift model evaluation on RCT data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators