Explaining The Efficacy of Counterfactually-Augmented Data

Kaushik, Divyansh; Setlur, Amrith; Hovy, Eduard; Lipton, Zachary C.

Computer Science > Computation and Language

arXiv:2010.02114v1 (cs)

[Submitted on 5 Oct 2020 (this version), latest version 24 Mar 2021 (v4)]

Title:Explaining The Efficacy of Counterfactually-Augmented Data

Authors:Divyansh Kaushik, Amrith Setlur, Eduard Hovy, Zachary C. Lipton

View PDF

Abstract:In attempts to produce machine learning models less reliant on spurious patterns in training data, researchers have recently proposed a human-in-the-loop process for generating counterfactually augmented datasets. As applied in NLP, given some documents and their (initial) labels, humans are tasked with revising the text to make a (given) counterfactual label applicable. Importantly, the instructions prohibit edits that are not necessary to flip the applicable label. Models trained on the augmented (original and revised) data have been shown to rely less on semantically irrelevant words and to generalize better out-of-domain. While this work draws on causal thinking, casting edits as interventions and relying on human understanding to assess outcomes, the underlying causal model is not clear nor are the principles underlying the observed improvements in out-of-domain evaluation. In this paper, we explore a toy analog, using linear Gaussian models. Our analysis reveals interesting relationships between causal models, measurement noise, out-of-domain generalization, and reliance on spurious signals. Interestingly our analysis suggests that data corrupted by adding noise to causal features will degrade out-of-domain performance, while noise added to non-causal features may make models more robust out-of-domain. This analysis yields interesting insights that help to explain the efficacy of counterfactually augmented data. Finally, we present a large-scale empirical study that supports this hypothesis.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2010.02114 [cs.CL]
	(or arXiv:2010.02114v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2010.02114

Submission history

From: Divyansh Kaushik [view email]
[v1] Mon, 5 Oct 2020 15:57:07 UTC (2,359 KB)
[v2] Tue, 6 Oct 2020 02:21:13 UTC (2,359 KB)
[v3] Tue, 23 Mar 2021 02:02:40 UTC (7,427 KB)
[v4] Wed, 24 Mar 2021 01:46:15 UTC (7,427 KB)

Computer Science > Computation and Language

Title:Explaining The Efficacy of Counterfactually-Augmented Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Explaining The Efficacy of Counterfactually-Augmented Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators