Multilingual Auxiliary Tasks Training: Bridging the Gap between Languages for Zero-Shot Transfer of Hate Speech Detection Models

Montariol, Syrielle; Riabi, Arij; Seddah, Djamé

Computer Science > Computation and Language

arXiv:2210.13029 (cs)

[Submitted on 24 Oct 2022 (v1), last revised 25 Oct 2022 (this version, v2)]

Title:Multilingual Auxiliary Tasks Training: Bridging the Gap between Languages for Zero-Shot Transfer of Hate Speech Detection Models

Authors:Syrielle Montariol, Arij Riabi, Djamé Seddah

View PDF

Abstract:Zero-shot cross-lingual transfer learning has been shown to be highly challenging for tasks involving a lot of linguistic specificities or when a cultural gap is present between languages, such as in hate speech detection. In this paper, we highlight this limitation for hate speech detection in several domains and languages using strict experimental settings. Then, we propose to train on multilingual auxiliary tasks -- sentiment analysis, named entity recognition, and tasks relying on syntactic information -- to improve zero-shot transfer of hate speech detection models across languages. We show how hate speech detection models benefit from a cross-lingual knowledge proxy brought by auxiliary tasks fine-tuning and highlight these tasks' positive impact on bridging the hate speech linguistic and cultural gap between languages.

Comments:	Accepted to Findings of AACL-IJCNLP 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.13029 [cs.CL]
	(or arXiv:2210.13029v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.13029

Submission history

From: Arij Riabi [view email]
[v1] Mon, 24 Oct 2022 08:26:51 UTC (6,145 KB)
[v2] Tue, 25 Oct 2022 08:20:35 UTC (6,145 KB)

Computer Science > Computation and Language

Title:Multilingual Auxiliary Tasks Training: Bridging the Gap between Languages for Zero-Shot Transfer of Hate Speech Detection Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multilingual Auxiliary Tasks Training: Bridging the Gap between Languages for Zero-Shot Transfer of Hate Speech Detection Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators