Comparative Analysis of Machine Learning and Deep Learning Algorithms for Detection of Online Hate Speech

Dhamija, Tashvik; Anjum; Katarya, Rahul

Computer Science > Computation and Language

arXiv:2108.01063 (cs)

[Submitted on 23 Apr 2021]

Title:Comparative Analysis of Machine Learning and Deep Learning Algorithms for Detection of Online Hate Speech

Authors:Tashvik Dhamija, Anjum, Rahul Katarya

View PDF

Abstract:In the day and age of social media, users have become prone to online hate speech. Several attempts have been made to classify hate speech using machine learning but the state-of-the-art models are not robust enough for practical applications. This is attributed to the use of primitive NLP feature engineering techniques. In this paper, we explored various feature engineering techniques ranging from different embeddings to conventional NLP algorithms. We also experimented with combinations of different features. From our experimentation, we realized that roBERTa (robustly optimized BERT approach) based sentence embeddings classified using decision trees gives the best results of 0.9998 F1 score. In our paper, we concluded that BERT based embeddings give the most useful features for this problem and have the capacity to be made into a practical robust model.

Comments:	10 pages
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2108.01063 [cs.CL]
	(or arXiv:2108.01063v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2108.01063
Journal reference:	Advances in Materials Science and Engineering - Select Proceedings of CAMSE 2020,Paper_ID_ 95

Submission history

From: Anjum Anjum [view email]
[v1] Fri, 23 Apr 2021 04:19:15 UTC (88 KB)

Computer Science > Computation and Language

Title:Comparative Analysis of Machine Learning and Deep Learning Algorithms for Detection of Online Hate Speech

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Comparative Analysis of Machine Learning and Deep Learning Algorithms for Detection of Online Hate Speech

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators