Can Explanations Be Useful for Calibrating Black Box Models?

Ye, Xi; Durrett, Greg

Computer Science > Computation and Language

arXiv:2110.07586v1 (cs)

[Submitted on 14 Oct 2021 (this version), latest version 15 Mar 2022 (v2)]

Title:Can Explanations Be Useful for Calibrating Black Box Models?

Authors:Xi Ye, Greg Durrett

View PDF

Abstract:One often wants to take an existing, trained NLP model and use it on data from a new domain. While fine-tuning or few-shot learning can be used to adapt the base model, there is no one simple recipe to getting these working; moreover, one may not have access to the original model weights if it is deployed as a black box. To this end, we study how to improve a black box model's performance on a new domain given examples from the new domain by leveraging explanations of the model's behavior. Our approach first extracts a set of features combining human intuition about the task with model attributions generated by black box interpretation techniques, and then uses a simple model to calibrate or rerank the model's predictions based on the features. We experiment with our method on two tasks, extractive question answering and natural language inference, covering adaptation from several pairs of domains. The experimental results across all the domain pairs show that explanations are useful for calibrating these models. We show that the calibration features transfer to some extent between tasks and shed light on how to effectively use them.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2110.07586 [cs.CL]
	(or arXiv:2110.07586v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2110.07586

Submission history

From: Xi Ye [view email]
[v1] Thu, 14 Oct 2021 17:48:16 UTC (385 KB)
[v2] Tue, 15 Mar 2022 03:51:13 UTC (160 KB)

Computer Science > Computation and Language

Title:Can Explanations Be Useful for Calibrating Black Box Models?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Can Explanations Be Useful for Calibrating Black Box Models?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators