SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Duong, Song; Bronnec, Florian Le; Allauzen, Alexandre; Guigue, Vincent; Lumbreras, Alberto; Soulier, Laure; Gallinari, Patrick

Computer Science > Computation and Language

arXiv:2502.13674 (cs)

[Submitted on 19 Feb 2025]

Title:SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Authors:Song Duong, Florian Le Bronnec, Alexandre Allauzen, Vincent Guigue, Alberto Lumbreras, Laure Soulier, Patrick Gallinari

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs), when used for conditional text generation, often produce hallucinations, i.e., information that is unfaithful or not grounded in the input context. This issue arises in typical conditional text generation tasks, such as text summarization and data-to-text generation, where the goal is to produce fluent text based on contextual input. When fine-tuned on specific domains, LLMs struggle to provide faithful answers to a given context, often adding information or generating errors. One underlying cause of this issue is that LLMs rely on statistical patterns learned from their training data. This reliance can interfere with the model's ability to stay faithful to a provided context, leading to the generation of ungrounded information. We build upon this observation and introduce a novel self-supervised method for generating a training set of unfaithful samples. We then refine the model using a training process that encourages the generation of grounded outputs over unfaithful ones, drawing on preference-based training. Our approach leads to significantly more grounded text generation, outperforming existing self-supervised techniques in faithfulness, as evaluated through automatic metrics, LLM-based assessments, and human evaluations.

Comments:	10 pages, ICLR 2025 conference
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.13674 [cs.CL]
	(or arXiv:2502.13674v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.13674

Submission history

From: Florian Le Bronnec [view email]
[v1] Wed, 19 Feb 2025 12:31:58 UTC (1,981 KB)

Computer Science > Computation and Language

Title:SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators