Chinese Offensive Language Detection:Current Status and Future Directions

Xiao, Yunze; Bouamor, Houda; Zaghouani, Wajdi

Computer Science > Computation and Language

arXiv:2403.18314 (cs)

[Submitted on 27 Mar 2024 (v1), last revised 29 Mar 2024 (this version, v3)]

Title:Chinese Offensive Language Detection:Current Status and Future Directions

Authors:Yunze Xiao, Houda Bouamor, Wajdi Zaghouani

View PDF HTML (experimental)

Abstract:Despite the considerable efforts being made to monitor and regulate user-generated content on social media platforms, the pervasiveness of offensive language, such as hate speech or cyberbullying, in the digital space remains a significant challenge. Given the importance of maintaining a civilized and respectful online environment, there is an urgent and growing need for automatic systems capable of detecting offensive speech in real time. However, developing effective systems for processing languages such as Chinese presents a significant challenge, owing to the language's complex and nuanced nature, which makes it difficult to process automatically. This paper provides a comprehensive overview of offensive language detection in Chinese, examining current benchmarks and approaches and highlighting specific models and tools for addressing the unique challenges of detecting offensive language in this complex language. The primary objective of this survey is to explore the existing techniques and identify potential avenues for further research that can address the cultural and linguistic complexities of Chinese.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.18314 [cs.CL]
	(or arXiv:2403.18314v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.18314

Submission history

From: Yunze Xiao [view email]
[v1] Wed, 27 Mar 2024 07:34:44 UTC (8,446 KB)
[v2] Thu, 28 Mar 2024 05:27:43 UTC (8,834 KB)
[v3] Fri, 29 Mar 2024 18:48:35 UTC (8,834 KB)

Computer Science > Computation and Language

Title:Chinese Offensive Language Detection:Current Status and Future Directions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Chinese Offensive Language Detection:Current Status and Future Directions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators