HalCECE: A Framework for Explainable Hallucination Detection through Conceptual Counterfactuals in Image Captioning

Lymperaiou, Maria; FIlandrianos, Giorgos; Dimitriou, Angeliki; Voulodimos, Athanasios; Stamou, Giorgos

Computer Science > Computer Vision and Pattern Recognition

arXiv:2503.00436 (cs)

[Submitted on 1 Mar 2025]

Title:HalCECE: A Framework for Explainable Hallucination Detection through Conceptual Counterfactuals in Image Captioning

Authors:Maria Lymperaiou, Giorgos FIlandrianos, Angeliki Dimitriou, Athanasios Voulodimos, Giorgos Stamou

View PDF HTML (experimental)

Abstract:In the dynamic landscape of artificial intelligence, the exploration of hallucinations within vision-language (VL) models emerges as a critical frontier. This work delves into the intricacies of hallucinatory phenomena exhibited by widely used image captioners, unraveling interesting patterns. Specifically, we step upon previously introduced techniques of conceptual counterfactual explanations to address VL hallucinations. The deterministic and efficient nature of the employed conceptual counterfactuals backbone is able to suggest semantically minimal edits driven by hierarchical knowledge, so that the transition from a hallucinated caption to a non-hallucinated one is performed in a black-box manner. HalCECE, our proposed hallucination detection framework is highly interpretable, by providing semantically meaningful edits apart from standalone numbers, while the hierarchical decomposition of hallucinated concepts leads to a thorough hallucination analysis. Another novelty tied to the current work is the investigation of role hallucinations, being one of the first works to involve interconnections between visual concepts in hallucination detection. Overall, HalCECE recommends an explainable direction to the crucial field of VL hallucination detection, thus fostering trustworthy evaluation of current and future VL systems.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2503.00436 [cs.CV]
	(or arXiv:2503.00436v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2503.00436

Submission history

From: Maria Lymperaiou [view email]
[v1] Sat, 1 Mar 2025 10:28:19 UTC (1,282 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:HalCECE: A Framework for Explainable Hallucination Detection through Conceptual Counterfactuals in Image Captioning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:HalCECE: A Framework for Explainable Hallucination Detection through Conceptual Counterfactuals in Image Captioning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators