Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering

Ma, Kaixin; Ilievski, Filip; Francis, Jonathan; Bisk, Yonatan; Nyberg, Eric; Oltramari, Alessandro

Computer Science > Computation and Language

arXiv:2011.03863 (cs)

[Submitted on 7 Nov 2020 (v1), last revised 14 Dec 2020 (this version, v2)]

Title:Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering

Authors:Kaixin Ma, Filip Ilievski, Jonathan Francis, Yonatan Bisk, Eric Nyberg, Alessandro Oltramari

View PDF

Abstract:Recent developments in pre-trained neural language modeling have led to leaps in accuracy on commonsense question-answering benchmarks. However, there is increasing concern that models overfit to specific tasks, without learning to utilize external knowledge or perform general semantic reasoning. In contrast, zero-shot evaluations have shown promise as a more robust measure of a model's general reasoning abilities. In this paper, we propose a novel neuro-symbolic framework for zero-shot question answering across commonsense tasks. Guided by a set of hypotheses, the framework studies how to transform various pre-existing knowledge resources into a form that is most effective for pre-training models. We vary the set of language models, training regimes, knowledge sources, and data generation strategies, and measure their impact across tasks. Extending on prior work, we devise and compare four constrained distractor-sampling strategies. We provide empirical results across five commonsense question-answering tasks with data generated from five external knowledge resources. We show that, while an individual knowledge graph is better suited for specific tasks, a global knowledge graph brings consistent gains across different tasks. In addition, both preserving the structure of the task as well as generating fair and informative questions help language models learn more effectively.

Comments:	AAAI 2021
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2011.03863 [cs.CL]
	(or arXiv:2011.03863v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2011.03863

Submission history

From: Kaixin Ma [view email]
[v1] Sat, 7 Nov 2020 22:52:21 UTC (705 KB)
[v2] Mon, 14 Dec 2020 22:27:10 UTC (711 KB)

Computer Science > Computation and Language

Title:Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators