RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Yu, Yue; Ping, Wei; Liu, Zihan; Wang, Boxin; You, Jiaxuan; Zhang, Chao; Shoeybi, Mohammad; Catanzaro, Bryan

Computer Science > Computation and Language

arXiv:2407.02485 (cs)

[Submitted on 2 Jul 2024]

Title:RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Authors:Yue Yu, Wei Ping, Zihan Liu, Boxin Wang, Jiaxuan You, Chao Zhang, Mohammad Shoeybi, Bryan Catanzaro

View PDF HTML (experimental)

Abstract:Large language models (LLMs) typically utilize the top-k contexts from a retriever in retrieval-augmented generation (RAG). In this work, we propose a novel instruction fine-tuning framework RankRAG, which instruction-tunes a single LLM for the dual purpose of context ranking and answer generation in RAG. In particular, the instruction-tuned LLMs work surprisingly well by adding a small fraction of ranking data into the training blend, and outperform existing expert ranking models, including the same LLM exclusively fine-tuned on a large amount of ranking data. For generation, we compare our model with many strong baselines, including GPT-4-0613, GPT-4-turbo-2024-0409, and ChatQA-1.5, an open-sourced model with the state-of-the-art performance on RAG benchmarks. Specifically, our Llama3-RankRAG significantly outperforms Llama3-ChatQA-1.5 and GPT-4 models on nine knowledge-intensive benchmarks. In addition, it also performs comparably to GPT-4 on five RAG benchmarks in the biomedical domain without instruction fine-tuning on biomedical data, demonstrating its superb capability for generalization to new domains.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2407.02485 [cs.CL]
	(or arXiv:2407.02485v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.02485

Submission history

From: Wei Ping [view email]
[v1] Tue, 2 Jul 2024 17:59:17 UTC (614 KB)

Computer Science > Computation and Language

Title:RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators