Open-Domain Question Answering Goes Conversational via Question Rewriting

Anantha, Raviteja; Vakulenko, Svitlana; Tu, Zhucheng; Longpre, Shayne; Pulman, Stephen; Chappidi, Srinivas

Computer Science > Information Retrieval

arXiv:2010.04898v1 (cs)

[Submitted on 10 Oct 2020 (this version), latest version 14 Apr 2021 (v3)]

Title:Open-Domain Question Answering Goes Conversational via Question Rewriting

Authors:Raviteja Anantha, Svitlana Vakulenko, Zhucheng Tu, Shayne Longpre, Stephen Pulman, Srinivas Chappidi

View PDF

Abstract:We introduce a new dataset for Question Rewriting in Conversational Context (QReCC), which contains 14K conversations with 81K question-answer pairs. The task in QReCC is to find answers to conversational questions within a collection of 10M web pages (split into 54M passages). Answers to questions in the same conversation may be distributed across several web pages. QReCC provides annotations that allow us to train and evaluate individual subtasks of question rewriting, passage retrieval and reading comprehension required for the end-to-end conversational question answering (QA) task. We report the effectiveness of a strong baseline approach that combines the state-of-the-art model for question rewriting, and competitive models for open-domain QA. Our results set the first baseline for the QReCC dataset with F1 of 19.07, compared to the human upper bound of 74.47, indicating the difficulty of the setup and a large room for improvement.

Comments:	15 pages, 10 tables, 3 figures
Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL)
Cite as:	arXiv:2010.04898 [cs.IR]
	(or arXiv:2010.04898v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2010.04898

Submission history

From: Raviteja Anantha [view email]
[v1] Sat, 10 Oct 2020 04:28:42 UTC (2,689 KB)
[v2] Tue, 13 Apr 2021 08:01:39 UTC (9,027 KB)
[v3] Wed, 14 Apr 2021 19:09:19 UTC (9,028 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.IR

< prev | next >

new | recent | 2020-10

Change to browse by:

cs
cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

Svitlana Vakulenko
Zhucheng Tu
Shayne Longpre
Stephen Pulman

export BibTeX citation

Computer Science > Information Retrieval

Title:Open-Domain Question Answering Goes Conversational via Question Rewriting

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Open-Domain Question Answering Goes Conversational via Question Rewriting

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators