RARR: Researching and Revising What Language Models Say, Using Language Models

Gao, Luyu; Dai, Zhuyun; Pasupat, Panupong; Chen, Anthony; Chaganty, Arun Tejasvi; Fan, Yicheng; Zhao, Vincent Y.; Lao, Ni; Lee, Hongrae; Juan, Da-Cheng; Guu, Kelvin

Computer Science > Computation and Language

arXiv:2210.08726 (cs)

[Submitted on 17 Oct 2022 (v1), last revised 31 May 2023 (this version, v3)]

Title:RARR: Researching and Revising What Language Models Say, Using Language Models

Authors:Luyu Gao, Zhuyun Dai, Panupong Pasupat, Anthony Chen, Arun Tejasvi Chaganty, Yicheng Fan, Vincent Y. Zhao, Ni Lao, Hongrae Lee, Da-Cheng Juan, Kelvin Guu

View PDF

Abstract:Language models (LMs) now excel at many tasks such as few-shot learning, question answering, reasoning, and dialog. However, they sometimes generate unsupported or misleading content. A user cannot easily determine whether their outputs are trustworthy or not, because most LMs do not have any built-in mechanism for attribution to external evidence. To enable attribution while still preserving all the powerful advantages of recent generation models, we propose RARR (Retrofit Attribution using Research and Revision), a system that 1) automatically finds attribution for the output of any text generation model and 2) post-edits the output to fix unsupported content while preserving the original output as much as possible. When applied to the output of several state-of-the-art LMs on a diverse set of generation tasks, we find that RARR significantly improves attribution while otherwise preserving the original input to a much greater degree than previously explored edit models. Furthermore, the implementation of RARR requires only a handful of training examples, a large language model, and standard web search.

Comments:	ACL 2023
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2210.08726 [cs.CL]
	(or arXiv:2210.08726v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.08726

Submission history

From: Panupong Pasupat [view email]
[v1] Mon, 17 Oct 2022 03:44:30 UTC (965 KB)
[v2] Sun, 4 Dec 2022 07:13:39 UTC (965 KB)
[v3] Wed, 31 May 2023 17:55:02 UTC (966 KB)

Computer Science > Computation and Language

Title:RARR: Researching and Revising What Language Models Say, Using Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:RARR: Researching and Revising What Language Models Say, Using Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators