Context Awareness Gate For Retrieval Augmented Generation

Heydari, Mohammad Hassan; Hemmat, Arshia; Naman, Erfan; Fatemi, Afsaneh

Computer Science > Machine Learning

arXiv:2411.16133 (cs)

[Submitted on 25 Nov 2024 (v1), last revised 6 Jan 2025 (this version, v2)]

Title:Context Awareness Gate For Retrieval Augmented Generation

Authors:Mohammad Hassan Heydari, Arshia Hemmat, Erfan Naman, Afsaneh Fatemi

View PDF HTML (experimental)

Abstract:Retrieval Augmented Generation (RAG) has emerged as a widely adopted approach to mitigate the limitations of large language models (LLMs) in answering domain-specific questions. Previous research has predominantly focused on improving the accuracy and quality of retrieved data chunks to enhance the overall performance of the generation pipeline. However, despite ongoing advancements, the critical issue of retrieving irrelevant information -- which can impair the ability of the model to utilize its internal knowledge effectively -- has received minimal attention. In this work, we investigate the impact of retrieving irrelevant information in open-domain question answering, highlighting its significant detrimental effect on the quality of LLM outputs. To address this challenge, we propose the Context Awareness Gate (CAG) architecture, a novel mechanism that dynamically adjusts the LLMs' input prompt based on whether the user query necessitates external context retrieval. Additionally, we introduce the Vector Candidates method, a core mathematical component of CAG that is statistical, LLM-independent, and highly scalable. We further examine the distributions of relationships between contexts and questions, presenting a statistical analysis of these distributions. This analysis can be leveraged to enhance the context retrieval process in Retrieval Augmented Generation (RAG) systems.

Subjects:	Machine Learning (cs.LG); Information Retrieval (cs.IR)
Cite as:	arXiv:2411.16133 [cs.LG]
	(or arXiv:2411.16133v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.16133

Submission history

From: Arshia Hemmat [view email]
[v1] Mon, 25 Nov 2024 06:48:38 UTC (140 KB)
[v2] Mon, 6 Jan 2025 18:23:41 UTC (141 KB)

Computer Science > Machine Learning

Title:Context Awareness Gate For Retrieval Augmented Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Context Awareness Gate For Retrieval Augmented Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators