CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation

Lee, Youngwon; Hwang, Seung-won; Campos, Daniel; Graliński, Filip; Yao, Zhewei; He, Yuxiong

Computer Science > Computation and Language

arXiv:2412.14581 (cs)

[Submitted on 19 Dec 2024]

Title:CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation

Authors:Youngwon Lee, Seung-won Hwang, Daniel Campos, Filip Graliński, Zhewei Yao, Yuxiong He

View PDF HTML (experimental)

Abstract:With the adoption of retrieval-augmented generation (RAG), large language models (LLMs) are expected to ground their generation to the retrieved contexts. Yet, this is hindered by position bias of LLMs, failing to evenly attend to all contexts. Previous work has addressed this by synthesizing contexts with perturbed positions of gold segment, creating a position-diversified train set. We extend this intuition to propose consistency regularization with augmentation and distillation. First, we augment each training instance with its position perturbation to encourage consistent predictions, regardless of ordering. We also distill behaviors of this pair, although it can be counterproductive in certain RAG scenarios where the given order from the retriever is crucial for generation quality. We thus propose CORD, balancing COnsistency and Rank Distillation. CORD adaptively samples noise-controlled perturbations from an interpolation space, ensuring both consistency and respect for the rank prior. Empirical results show this balance enables CORD to outperform consistently in diverse RAG benchmarks.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.14581 [cs.CL]
	(or arXiv:2412.14581v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.14581

Submission history

From: Youngwon Lee [view email]
[v1] Thu, 19 Dec 2024 07:01:25 UTC (354 KB)

Computer Science > Computation and Language

Title:CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators