CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Sachdeva, Rachneet; Tutek, Martin; Gurevych, Iryna

Computer Science > Computation and Language

arXiv:2309.07822v1 (cs)

[Submitted on 14 Sep 2023 (this version), latest version 13 Feb 2024 (v3)]

Title:CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Authors:Rachneet Sachdeva, Martin Tutek, Iryna Gurevych

View PDF

Abstract:In recent years, large language models (LLMs) have shown remarkable capabilities at scale, particularly at generating text conditioned on a prompt. In our work, we investigate the use of LLMs to augment training data of small language models~(SLMs) with automatically generated counterfactual~(CF) instances -- i.e. minimally altered inputs -- in order to improve out-of-domain~(OOD) performance of SLMs in the extractive question answering~(QA) setup. We show that, across various LLM generators, such data augmentation consistently enhances OOD performance and improves model calibration for both confidence-based and rationale-augmented calibrator models. Furthermore, these performance improvements correlate with higher diversity of CF instances in terms of their surface form and semantic content. Finally, we show that CF augmented models which are easier to calibrate also exhibit much lower entropy when assigning importance, indicating that rationale-augmented calibrators prefer concise explanations.

Comments:	We make our code available at: this https URL
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2309.07822 [cs.CL]
	(or arXiv:2309.07822v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.07822

Submission history

From: Rachneet Sachdeva [view email]
[v1] Thu, 14 Sep 2023 16:16:40 UTC (543 KB)
[v2] Fri, 15 Sep 2023 07:57:55 UTC (544 KB)
[v3] Tue, 13 Feb 2024 10:52:52 UTC (2,739 KB)

Computer Science > Computation and Language

Title:CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators