CoRAG: Collaborative Retrieval-Augmented Generation

Muhamed, Aashiq; Diab, Mona; Smith, Virginia

Computer Science > Artificial Intelligence

arXiv:2504.01883 (cs)

[Submitted on 2 Apr 2025]

Title:CoRAG: Collaborative Retrieval-Augmented Generation

Authors:Aashiq Muhamed, Mona Diab, Virginia Smith

View PDF HTML (experimental)

Abstract:Retrieval-Augmented Generation (RAG) models excel in knowledge-intensive tasks, especially under few-shot learning constraints. We introduce CoRAG, a framework extending RAG to collaborative settings, where clients jointly train a shared model using a collaborative passage store. To evaluate CoRAG, we introduce CRAB, a benchmark for collaborative homogeneous open-domain question answering. Our experiments demonstrate that CoRAG consistently outperforms both parametric collaborative learning methods and locally trained RAG models in low-resource scenarios. Further analysis reveals the critical importance of relevant passages within the shared store, the surprising benefits of incorporating irrelevant passages, and the potential for hard negatives to negatively impact performance. This introduces a novel consideration in collaborative RAG: the trade-off between leveraging a collectively enriched knowledge base and the potential risk of incorporating detrimental passages from other clients. Our findings underscore the viability of CoRAG, while also highlighting key design challenges and promising avenues for future research.

Comments:	NAACL 2024
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2504.01883 [cs.AI]
	(or arXiv:2504.01883v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2504.01883

Submission history

From: Aashiq Muhamed [view email]
[v1] Wed, 2 Apr 2025 16:40:43 UTC (101 KB)

Computer Science > Artificial Intelligence

Title:CoRAG: Collaborative Retrieval-Augmented Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:CoRAG: Collaborative Retrieval-Augmented Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators