Bias Challenges in Counterfactual Data Augmentation

Mouli, S Chandra; Zhou, Yangze; Ribeiro, Bruno

Computer Science > Machine Learning

arXiv:2209.05104 (cs)

[Submitted on 12 Sep 2022 (v1), last revised 13 Sep 2022 (this version, v2)]

Title:Bias Challenges in Counterfactual Data Augmentation

Authors:S Chandra Mouli, Yangze Zhou, Bruno Ribeiro

View PDF

Abstract:Deep learning models tend not to be out-of-distribution robust primarily due to their reliance on spurious features to solve the task. Counterfactual data augmentations provide a general way of (approximately) achieving representations that are counterfactual-invariant to spurious features, a requirement for out-of-distribution (OOD) robustness. In this work, we show that counterfactual data augmentations may not achieve the desired counterfactual-invariance if the augmentation is performed by a context-guessing machine, an abstract machine that guesses the most-likely context of a given input. We theoretically analyze the invariance imposed by such counterfactual data augmentations and describe an exemplar NLP task where counterfactual data augmentation by a context-guessing machine does not lead to robust OOD classifiers.

Comments:	Accepted at UAI 2022 Workshop on Causal Representation Learning
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2209.05104 [cs.LG]
	(or arXiv:2209.05104v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.05104

Submission history

From: Yangze Zhou [view email]
[v1] Mon, 12 Sep 2022 09:17:49 UTC (135 KB)
[v2] Tue, 13 Sep 2022 19:37:26 UTC (135 KB)

Computer Science > Machine Learning

Title:Bias Challenges in Counterfactual Data Augmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Bias Challenges in Counterfactual Data Augmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators