The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Zhang, Yuji; Li, Sha; Qian, Cheng; Liu, Jiateng; Yu, Pengfei; Han, Chi; Fung, Yi R.; McKeown, Kathleen; Zhai, Chengxiang; Li, Manling; Ji, Heng

Computer Science > Computation and Language

arXiv:2502.16143 (cs)

[Submitted on 22 Feb 2025]

Title:The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Authors:Yuji Zhang, Sha Li, Cheng Qian, Jiateng Liu, Pengfei Yu, Chi Han, Yi R. Fung, Kathleen McKeown, Chengxiang Zhai, Manling Li, Heng Ji

View PDF HTML (experimental)

Abstract:Hallucination is a persistent challenge in large language models (LLMs), where even with rigorous quality control, models often generate distorted facts. This paradox, in which error generation continues despite high-quality training data, calls for a deeper understanding of the underlying LLM mechanisms. To address it, we propose a novel concept: knowledge overshadowing, where model's dominant knowledge can obscure less prominent knowledge during text generation, causing the model to fabricate inaccurate details. Building on this idea, we introduce a novel framework to quantify factual hallucinations by modeling knowledge overshadowing. Central to our approach is the log-linear law, which predicts that the rate of factual hallucination increases linearly with the logarithmic scale of (1) Knowledge Popularity, (2) Knowledge Length, and (3) Model Size. The law provides a means to preemptively quantify hallucinations, offering foresight into their occurrence even before model training or inference. Built on overshadowing effect, we propose a new decoding strategy CoDa, to mitigate hallucinations, which notably enhance model factuality on Overshadow (27.9%), MemoTrap (13.1%) and NQ-Swap (18.3%). Our findings not only deepen understandings of the underlying mechanisms behind hallucinations but also provide actionable insights for developing more predictable and controllable language models.

Comments:	19 pages, 5 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.16143 [cs.CL]
	(or arXiv:2502.16143v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.16143

Submission history

From: Yuji Zhang [view email]
[v1] Sat, 22 Feb 2025 08:36:06 UTC (7,771 KB)

Computer Science > Computation and Language

Title:The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators