Multi-Mention Learning for Reading Comprehension with Neural Cascades

Swayamdipta, Swabha; Parikh, Ankur P.; Kwiatkowski, Tom

Computer Science > Computation and Language

arXiv:1711.00894 (cs)

[Submitted on 2 Nov 2017 (v1), last revised 30 May 2018 (this version, v2)]

Title:Multi-Mention Learning for Reading Comprehension with Neural Cascades

Authors:Swabha Swayamdipta, Ankur P. Parikh, Tom Kwiatkowski

View PDF

Abstract:Reading comprehension is a challenging task, especially when executed across longer or across multiple evidence documents, where the answer is likely to reoccur. Existing neural architectures typically do not scale to the entire evidence, and hence, resort to selecting a single passage in the document (either via truncation or other means), and carefully searching for the answer within that passage. However, in some cases, this strategy can be suboptimal, since by focusing on a specific passage, it becomes difficult to leverage multiple mentions of the same answer throughout the document. In this work, we take a different approach by constructing lightweight models that are combined in a cascade to find the answer. Each submodel consists only of feed-forward networks equipped with an attention mechanism, making it trivially parallelizable. We show that our approach can scale to approximately an order of magnitude larger evidence documents and can aggregate information at the representation level from multiple mentions of each answer candidate across the document. Empirically, our approach achieves state-of-the-art performance on both the Wikipedia and web domains of the TriviaQA dataset, outperforming more complex, recurrent architectures.

Comments:	Proceedings of ICLR 2018
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1711.00894 [cs.CL]
	(or arXiv:1711.00894v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1711.00894

Submission history

From: Swabha Swayamdipta [view email]
[v1] Thu, 2 Nov 2017 19:13:55 UTC (62 KB)
[v2] Wed, 30 May 2018 23:27:02 UTC (854 KB)

Computer Science > Computation and Language

Title:Multi-Mention Learning for Reading Comprehension with Neural Cascades

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multi-Mention Learning for Reading Comprehension with Neural Cascades

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators