Multilingual Fine-Grained News Headline Hallucination Detection

Shen, Jiaming; Liu, Tianqi; Liu, Jialu; Qin, Zhen; Pavagadhi, Jay; Baumgartner, Simon; Bendersky, Michael

Computer Science > Computation and Language

arXiv:2407.15975 (cs)

[Submitted on 22 Jul 2024]

Title:Multilingual Fine-Grained News Headline Hallucination Detection

Authors:Jiaming Shen, Tianqi Liu, Jialu Liu, Zhen Qin, Jay Pavagadhi, Simon Baumgartner, Michael Bendersky

View PDF HTML (experimental)

Abstract:The popularity of automated news headline generation has surged with advancements in pre-trained language models. However, these models often suffer from the ``hallucination'' problem, where the generated headline is not fully supported by its source article. Efforts to address this issue have predominantly focused on English, using over-simplistic classification schemes that overlook nuanced hallucination types. In this study, we introduce the first multilingual, fine-grained news headline hallucination detection dataset that contains over 11 thousand pairs in 5 languages, each annotated with detailed hallucination types by experts. We conduct extensive experiments on this dataset under two settings. First, we implement several supervised fine-tuning approaches as preparatory solutions and demonstrate this dataset's challenges and utilities. Second, we test various large language models' in-context learning abilities and propose two novel techniques, language-dependent demonstration selection and coarse-to-fine prompting, to boost the few-shot hallucination detection performance in terms of the example-F1 metric. We release this dataset to foster further research in multilingual, fine-grained headline hallucination detection.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2407.15975 [cs.CL]
	(or arXiv:2407.15975v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.15975

Submission history

From: Jiaming Shen [view email]
[v1] Mon, 22 Jul 2024 18:37:53 UTC (4,214 KB)

Computer Science > Computation and Language

Title:Multilingual Fine-Grained News Headline Hallucination Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multilingual Fine-Grained News Headline Hallucination Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators