Efficient Federated Search for Retrieval-Augmented Generation

Guerraoui, Rachid; Kermarrec, Anne-Marie; Petrescu, Diana; Pires, Rafael; Randl, Mathis; de Vos, Martijn

Computer Science > Machine Learning

arXiv:2502.19280 (cs)

[Submitted on 26 Feb 2025]

Title:Efficient Federated Search for Retrieval-Augmented Generation

Authors:Rachid Guerraoui, Anne-Marie Kermarrec, Diana Petrescu, Rafael Pires, Mathis Randl, Martijn de Vos

View PDF HTML (experimental)

Abstract:Large language models (LLMs) have demonstrated remarkable capabilities across various domains but remain susceptible to hallucinations and inconsistencies, limiting their reliability. Retrieval-augmented generation (RAG) mitigates these issues by grounding model responses in external knowledge sources. Existing RAG workflows often leverage a single vector database, which is impractical in the common setting where information is distributed across multiple repositories. We introduce RAGRoute, a novel mechanism for federated RAG search. RAGRoute dynamically selects relevant data sources at query time using a lightweight neural network classifier. By not querying every data source, this approach significantly reduces query overhead, improves retrieval efficiency, and minimizes the retrieval of irrelevant information. We evaluate RAGRoute using the MIRAGE and MMLU benchmarks and demonstrate its effectiveness in retrieving relevant documents while reducing the number of queries. RAGRoute reduces the total number of queries up to 77.5% and communication volume up to 76.2%.

Comments:	To appear in the proceedings of EuroMLSys'25
Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Information Retrieval (cs.IR)
Cite as:	arXiv:2502.19280 [cs.LG]
	(or arXiv:2502.19280v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.19280

Submission history

From: Diana Petrescu [view email]
[v1] Wed, 26 Feb 2025 16:36:24 UTC (1,240 KB)

Computer Science > Machine Learning

Title:Efficient Federated Search for Retrieval-Augmented Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient Federated Search for Retrieval-Augmented Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators