Self-Memory Alignment: Mitigating Factual Hallucinations with Generalized Improvement

Zhang, Siyuan; Zhang, Yichi; Dong, Yinpeng; Su, Hang

Computer Science > Computation and Language

arXiv:2502.19127 (cs)

[Submitted on 26 Feb 2025]

Title:Self-Memory Alignment: Mitigating Factual Hallucinations with Generalized Improvement

Authors:Siyuan Zhang, Yichi Zhang, Yinpeng Dong, Hang Su

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) often struggle to align their responses with objective facts, resulting in the issue of factual hallucinations, which can be difficult to detect and mislead users without relevant knowledge. While post-training techniques have been employed to mitigate the issue, existing methods usually suffer from poor generalization and trade-offs in different capabilities. In this paper, we propose to address it by directly augmenting LLM's fundamental ability to precisely leverage its existing memory--the knowledge acquired from pre-training data. We introduce self-memory alignment (SMA), which fine-tunes the model on self-generated responses to precise and simple factual questions through preference optimization. Furthermore, we construct FactualBench, a comprehensive and precise factual QA dataset containing 181k Chinese data spanning 21 domains, to facilitate both evaluation and training. Extensive experiments show that SMA significantly improves LLMs' overall performance, with consistent enhancement across various benchmarks concerning factuality, as well as helpfulness and comprehensive skills.

Comments:	29 pages, 17 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.19127 [cs.CL]
	(or arXiv:2502.19127v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.19127

Submission history

From: Siyuan Zhang [view email]
[v1] Wed, 26 Feb 2025 13:34:52 UTC (486 KB)

Computer Science > Computation and Language

Title:Self-Memory Alignment: Mitigating Factual Hallucinations with Generalized Improvement

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Self-Memory Alignment: Mitigating Factual Hallucinations with Generalized Improvement

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators