CONCRETE: Improving Cross-lingual Fact-checking with Cross-lingual Retrieval

Huang, Kung-Hsiang; Zhai, ChengXiang; Ji, Heng

Abstract:Fact-checking has gained increasing attention due to the widespread of falsified information. Most fact-checking approaches focus on claims made in English only due to the data scarcity issue in other languages. The lack of fact-checking datasets in low-resource languages calls for an effective cross-lingual transfer technique for fact-checking. Additionally, trustworthy information in different languages can be complementary and helpful in verifying facts. To this end, we present the first fact-checking framework augmented with cross-lingual retrieval that aggregates evidence retrieved from multiple languages through a cross-lingual retriever. Given the absence of cross-lingual information retrieval datasets with claim-like queries, we train the retriever with our proposed Cross-lingual Inverse Cloze Task (X-ICT), a self-supervised algorithm that creates training instances by translating the title of a passage. The goal for X-ICT is to learn cross-lingual retrieval in which the model learns to identify the passage corresponding to a given translated title. On the X-Fact dataset, our approach achieves 2.23% absolute F1 improvement in the zero-shot cross-lingual setup over prior systems. The source code and data are publicly available at this https URL.

Comments:	Accepted by COLING 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2209.02071 [cs.CL]
	(or arXiv:2209.02071v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2209.02071

Computer Science > Computation and Language

Title:CONCRETE: Improving Cross-lingual Fact-checking with Cross-lingual Retrieval

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators