Better RAG using Relevant Information Gain

Pickett, Marc; Hartman, Jeremy; Bhowmick, Ayan Kumar; Alam, Raquib-ul; Vempaty, Aditya

Computer Science > Computation and Language

arXiv:2407.12101 (cs)

[Submitted on 16 Jul 2024]

Title:Better RAG using Relevant Information Gain

Authors:Marc Pickett, Jeremy Hartman, Ayan Kumar Bhowmick, Raquib-ul Alam, Aditya Vempaty

View PDF HTML (experimental)

Abstract:A common way to extend the memory of large language models (LLMs) is by retrieval augmented generation (RAG), which inserts text retrieved from a larger memory into an LLM's context window. However, the context window is typically limited to several thousand tokens, which limits the number of retrieved passages that can inform a model's response. For this reason, it's important to avoid occupying context window space with redundant information by ensuring a degree of diversity among retrieved passages. At the same time, the information should also be relevant to the current task. Most prior methods that encourage diversity among retrieved results, such as Maximal Marginal Relevance (MMR), do so by incorporating an objective that explicitly trades off diversity and relevance. We propose a novel simple optimization metric based on relevant information gain, a probabilistic measure of the total information relevant to a query for a set of retrieved results. By optimizing this metric, diversity organically emerges from our system. When used as a drop-in replacement for the retrieval component of a RAG system, this method yields state-of-the-art performance on question answering tasks from the Retrieval Augmented Generation Benchmark (RGB), outperforming existing metrics that directly optimize for relevance and diversity.

Comments:	4 page paper submitted to EMNLP
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2407.12101 [cs.CL]
	(or arXiv:2407.12101v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.12101

Submission history

From: Marc Pickett [view email]
[v1] Tue, 16 Jul 2024 18:09:21 UTC (710 KB)

Computer Science > Computation and Language

Title:Better RAG using Relevant Information Gain

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Better RAG using Relevant Information Gain

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators