Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering

Liu, Jiacheng; Hallinan, Skyler; Lu, Ximing; He, Pengfei; Welleck, Sean; Hajishirzi, Hannaneh; Choi, Yejin

Computer Science > Computation and Language

arXiv:2210.03078 (cs)

[Submitted on 6 Oct 2022 (v1), last revised 22 Oct 2022 (this version, v2)]

Title:Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering

Authors:Jiacheng Liu, Skyler Hallinan, Ximing Lu, Pengfei He, Sean Welleck, Hannaneh Hajishirzi, Yejin Choi

View PDF

Abstract:Knowledge underpins reasoning. Recent research demonstrates that when relevant knowledge is provided as additional context to commonsense question answering (QA), it can substantially enhance the performance even on top of state-of-the-art. The fundamental challenge is where and how to find such knowledge that is high quality and on point with respect to the question; knowledge retrieved from knowledge bases are incomplete and knowledge generated from language models are inconsistent. We present Rainier, or Reinforced Knowledge Introspector, that learns to generate contextually relevant knowledge in response to given questions. Our approach starts by imitating knowledge generated by GPT-3, then learns to generate its own knowledge via reinforcement learning where rewards are shaped based on the increased performance on the resulting question answering. Rainier demonstrates substantial and consistent performance gains when tested over 9 different commonsense benchmarks: including 5 datasets that are seen during model training, as well as 4 datasets that are kept unseen. Our work is the first to report that knowledge generated by models that are orders of magnitude smaller than GPT-3, even without direct supervision on the knowledge itself, can exceed the quality of commonsense knowledge elicited from GPT-3.

Comments:	EMNLP 2022 main conference
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.03078 [cs.CL]
	(or arXiv:2210.03078v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.03078

Submission history

From: Jiacheng Liu [view email]
[v1] Thu, 6 Oct 2022 17:34:06 UTC (1,664 KB)
[v2] Sat, 22 Oct 2022 04:45:48 UTC (3,143 KB)

Computer Science > Computation and Language

Title:Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators