Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation

Nguyen, Hieu; He, Zihao; Gandre, Shoumik Atul; Pasupulety, Ujjwal; Shivakumar, Sharanya Kumari; Lerman, Kristina

Computer Science > Computation and Language

arXiv:2502.11306 (cs)

[Submitted on 16 Feb 2025]

Title:Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation

Authors:Hieu Nguyen, Zihao He, Shoumik Atul Gandre, Ujjwal Pasupulety, Sharanya Kumari Shivakumar, Kristina Lerman

View PDF HTML (experimental)

Abstract:Large language models (LLMs) often suffer from hallucination, generating factually incorrect or ungrounded content, which limits their reliability in high-stakes applications. A key factor contributing to hallucination is the use of hard labels during training, which enforce deterministic supervision, encourage overconfidence, and disregard the uncertainty inherent in natural language. To address this, we propose mitigating hallucination through knowledge distillation (KD), where a teacher model provides smoothed soft labels to a student model, reducing overconfidence and improving factual grounding. We apply KD during supervised finetuning on instructional data, evaluating its effectiveness across LLMs from different families. Experimental results on summarization benchmarks demonstrate that KD reduces hallucination compared to standard finetuning while preserving performance on general NLP tasks. These findings highlight KD as a promising approach for mitigating hallucination in LLMs and improving model reliability.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2502.11306 [cs.CL]
	(or arXiv:2502.11306v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.11306

Submission history

From: Zihao He [view email]
[v1] Sun, 16 Feb 2025 23:05:36 UTC (339 KB)

Computer Science > Computation and Language

Title:Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators