DEXTER: A Benchmark for open-domain Complex Question Answering using LLMs

Prabhu, Venktesh V. Deepali; Anand, Avishek

Computer Science > Computation and Language

arXiv:2406.17158 (cs)

[Submitted on 24 Jun 2024]

Title:DEXTER: A Benchmark for open-domain Complex Question Answering using LLMs

Authors:Venktesh V. Deepali Prabhu, Avishek Anand

View PDF HTML (experimental)

Abstract:Open-domain complex Question Answering (QA) is a difficult task with challenges in evidence retrieval and reasoning. The complexity of such questions could stem from questions being compositional, hybrid evidence, or ambiguity in questions. While retrieval performance for classical QA tasks is well explored, their capabilities for heterogeneous complex retrieval tasks, especially in an open-domain setting, and the impact on downstream QA performance, are relatively unexplored. To address this, in this work, we propose a benchmark composing diverse complex QA tasks and provide a toolkit to evaluate state-of-the-art pre-trained dense and sparse retrieval models in an open-domain setting. We observe that late interaction models and surprisingly lexical models like BM25 perform well compared to other pre-trained dense retrieval models. In addition, since context-based reasoning is critical for solving complex QA tasks, we also evaluate the reasoning capabilities of LLMs and the impact of retrieval performance on their reasoning capabilities. Through experiments, we observe that much progress is to be made in retrieval for complex QA to improve downstream QA performance. Our software and related data can be accessed at this https URL

Comments:	under submission, 22 pages
Subjects:	Computation and Language (cs.CL); Information Retrieval (cs.IR)
Cite as:	arXiv:2406.17158 [cs.CL]
	(or arXiv:2406.17158v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.17158

Submission history

From: Venktesh V [view email]
[v1] Mon, 24 Jun 2024 22:09:50 UTC (359 KB)

Computer Science > Computation and Language

Title:DEXTER: A Benchmark for open-domain Complex Question Answering using LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DEXTER: A Benchmark for open-domain Complex Question Answering using LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators