FrenchToxicityPrompts: a Large Benchmark for Evaluating and Mitigating Toxicity in French Texts

Brun, Caroline; Nikoulina, Vassilina

Computer Science > Computation and Language

arXiv:2406.17566 (cs)

[Submitted on 25 Jun 2024]

Title:FrenchToxicityPrompts: a Large Benchmark for Evaluating and Mitigating Toxicity in French Texts

Authors:Caroline Brun, Vassilina Nikoulina

View PDF HTML (experimental)

Abstract:Large language models (LLMs) are increasingly popular but are also prone to generating bias, toxic or harmful language, which can have detrimental effects on individuals and communities. Although most efforts is put to assess and mitigate toxicity in generated content, it is primarily concentrated on English, while it's essential to consider other languages as well. For addressing this issue, we create and release FrenchToxicityPrompts, a dataset of 50K naturally occurring French prompts and their continuations, annotated with toxicity scores from a widely used toxicity classifier. We evaluate 14 different models from four prevalent open-sourced families of LLMs against our dataset to assess their potential toxicity across various dimensions. We hope that our contribution will foster future research on toxicity detection and mitigation beyond Englis

Comments:	TRAC-2024, Fourth Workshop on Threat, Aggression and Cyberbullying. 20 May 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2406.17566 [cs.CL]
	(or arXiv:2406.17566v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.17566

Submission history

From: Caroline Brun [view email]
[v1] Tue, 25 Jun 2024 14:02:11 UTC (112 KB)

Computer Science > Computation and Language

Title:FrenchToxicityPrompts: a Large Benchmark for Evaluating and Mitigating Toxicity in French Texts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:FrenchToxicityPrompts: a Large Benchmark for Evaluating and Mitigating Toxicity in French Texts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators