The Effect of Data Poisoning on Counterfactual Explanations

Artelt, André; Sharma, Shubham; Lecué, Freddy; Hammer, Barbara

Computer Science > Machine Learning

arXiv:2402.08290 (cs)

[Submitted on 13 Feb 2024 (v1), last revised 21 May 2024 (this version, v3)]

Title:The Effect of Data Poisoning on Counterfactual Explanations

Authors:André Artelt, Shubham Sharma, Freddy Lecué, Barbara Hammer

View PDF HTML (experimental)

Abstract:Counterfactual explanations provide a popular method for analyzing the predictions of black-box systems, and they can offer the opportunity for computational recourse by suggesting actionable changes on how to change the input to obtain a different (i.e.\ more favorable) system output. However, recent work highlighted their vulnerability to different types of manipulations.
This work studies the vulnerability of counterfactual explanations to data poisoning. We formally introduce and investigate data poisoning in the context of counterfactual explanations for increasing the cost of recourse on three different levels: locally for a single instance, or a sub-group of instances, or globally for all instances. In this context, we characterize and prove the correctness of several different data poisonings. We also empirically demonstrate that state-of-the-art counterfactual generation methods and toolboxes are vulnerable to such data poisoning.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2402.08290 [cs.LG]
	(or arXiv:2402.08290v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.08290

Submission history

From: André Artelt [view email]
[v1] Tue, 13 Feb 2024 08:41:32 UTC (136 KB)
[v2] Thu, 2 May 2024 11:56:06 UTC (148 KB)
[v3] Tue, 21 May 2024 11:37:52 UTC (120 KB)

Computer Science > Machine Learning

Title:The Effect of Data Poisoning on Counterfactual Explanations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Effect of Data Poisoning on Counterfactual Explanations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators