Context-DPO: Aligning Language Models for Context-Faithfulness

Bi, Baolong; Huang, Shaohan; Wang, Yiwei; Yang, Tianchi; Zhang, Zihan; Huang, Haizhen; Mei, Lingrui; Fang, Junfeng; Li, Zehao; Wei, Furu; Deng, Weiwei; Sun, Feng; Zhang, Qi; Liu, Shenghua

Computer Science > Computation and Language

arXiv:2412.15280 (cs)

[Submitted on 18 Dec 2024]

Title:Context-DPO: Aligning Language Models for Context-Faithfulness

Authors:Baolong Bi, Shaohan Huang, Yiwei Wang, Tianchi Yang, Zihan Zhang, Haizhen Huang, Lingrui Mei, Junfeng Fang, Zehao Li, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Shenghua Liu

View PDF HTML (experimental)

Abstract:Reliable responses from large language models (LLMs) require adherence to user instructions and retrieved information. While alignment techniques help LLMs align with human intentions and values, improving context-faithfulness through alignment remains underexplored. To address this, we propose $\textbf{Context-DPO}$, the first alignment method specifically designed to enhance LLMs' context-faithfulness. We introduce $\textbf{ConFiQA}$, a benchmark that simulates Retrieval-Augmented Generation (RAG) scenarios with knowledge conflicts to evaluate context-faithfulness. By leveraging faithful and stubborn responses to questions with provided context from ConFiQA, our Context-DPO aligns LLMs through direct preference optimization. Extensive experiments demonstrate that our Context-DPO significantly improves context-faithfulness, achieving 35% to 280% improvements on popular open-source models. Further analysis demonstrates that Context-DPO preserves LLMs' generative capabilities while providing interpretable insights into context utilization. Our code and data are released at this https URL

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
Cite as:	arXiv:2412.15280 [cs.CL]
	(or arXiv:2412.15280v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.15280

Submission history

From: Baolong Bi [view email]
[v1] Wed, 18 Dec 2024 04:08:18 UTC (1,385 KB)

Computer Science > Computation and Language

Title:Context-DPO: Aligning Language Models for Context-Faithfulness

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Context-DPO: Aligning Language Models for Context-Faithfulness

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators