SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Chuang, Yung-Sung; Cohen-Wang, Benjamin; Shen, Shannon Zejiang; Wu, Zhaofeng; Xu, Hu; Lin, Xi Victoria; Glass, James; Li, Shang-Wen; Yih, Wen-tau

Computer Science > Computation and Language

arXiv:2502.09604 (cs)

[Submitted on 13 Feb 2025]

Title:SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Authors:Yung-Sung Chuang, Benjamin Cohen-Wang, Shannon Zejiang Shen, Zhaofeng Wu, Hu Xu, Xi Victoria Lin, James Glass, Shang-Wen Li, Wen-tau Yih

View PDF HTML (experimental)

Abstract:We introduce SelfCite, a novel self-supervised approach that aligns LLMs to generate high-quality, fine-grained, sentence-level citations for the statements in their generated responses. Instead of only relying on costly and labor-intensive annotations, SelfCite leverages a reward signal provided by the LLM itself through context ablation: If a citation is necessary, removing the cited text from the context should prevent the same response; if sufficient, retaining the cited text alone should preserve the same response. This reward can guide the inference-time best-of-N sampling strategy to improve citation quality significantly, as well as be used in preference optimization to directly fine-tune the models for generating better citations. The effectiveness of SelfCite is demonstrated by increasing citation F1 up to 5.3 points on the LongBench-Cite benchmark across five long-form question answering tasks.

Comments:	Implementation available at this https URL
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2502.09604 [cs.CL]
	(or arXiv:2502.09604v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.09604

Submission history

From: Yung-Sung Chuang [view email]
[v1] Thu, 13 Feb 2025 18:55:13 UTC (959 KB)

Computer Science > Computation and Language

Title:SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators