Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models

Liu, Qiang; Chen, Xinlong; Ding, Yue; Xu, Shizhen; Wu, Shu; Wang, Liang

Computer Science > Computation and Language

arXiv:2501.09997 (cs)

[Submitted on 17 Jan 2025]

Title:Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models

Authors:Qiang Liu, Xinlong Chen, Yue Ding, Shizhen Xu, Shu Wu, Liang Wang

View PDF HTML (experimental)

Abstract:Hallucination has emerged as a significant barrier to the effective application of Large Language Models (LLMs). In this work, we introduce a novel Attention-Guided SElf-Reflection (AGSER) approach for zero-shot hallucination detection in LLMs. The AGSER method utilizes attention contributions to categorize the input query into attentive and non-attentive queries. Each query is then processed separately through the LLMs, allowing us to compute consistency scores between the generated responses and the original answer. The difference between the two consistency scores serves as a hallucination estimator. In addition to its efficacy in detecting hallucinations, AGSER notably reduces computational complexity, requiring only three passes through the LLM and utilizing two sets of tokens. We have conducted extensive experiments with four widely-used LLMs across three different hallucination benchmarks, demonstrating that our approach significantly outperforms existing methods in zero-shot hallucination detection.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2501.09997 [cs.CL]
	(or arXiv:2501.09997v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.09997

Submission history

From: Qiang Liu [view email]
[v1] Fri, 17 Jan 2025 07:30:01 UTC (98 KB)

Computer Science > Computation and Language

Title:Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators