Zero-shot Code-Mixed Offensive Span Identification through Rationale Extraction

Ravikiran, Manikandan; Chakravarthi, Bharathi Raja

Computer Science > Computation and Language

arXiv:2205.06119 (cs)

[Submitted on 12 May 2022]

Title:Zero-shot Code-Mixed Offensive Span Identification through Rationale Extraction

Authors:Manikandan Ravikiran, Bharathi Raja Chakravarthi

View PDF

Abstract:This paper investigates the effectiveness of sentence-level transformers for zero-shot offensive span identification on a code-mixed Tamil dataset. More specifically, we evaluate rationale extraction methods of Local Interpretable Model Agnostic Explanations (LIME) \cite{DBLP:conf/kdd/Ribeiro0G16} and Integrated Gradients (IG) \cite{DBLP:conf/icml/SundararajanTY17} for adapting transformer based offensive language classification models for zero-shot offensive span identification. To this end, we find that LIME and IG show baseline $F_{1}$ of 26.35\% and 44.83\%, respectively. Besides, we study the effect of data set size and training process on the overall accuracy of span identification. As a result, we find both LIME and IG to show significant improvement with Masked Data Augmentation and Multilabel Training, with $F_{1}$ of 50.23\% and 47.38\% respectively. \textit{Disclaimer : This paper contains examples that may be considered profane, vulgar, or offensive. The examples do not represent the views of the authors or their employers/graduate schools towards any person(s), group(s), practice(s), or entity/entities. Instead they are used to emphasize only the linguistic research challenges.}

Comments:	Submission to this https URL
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2205.06119 [cs.CL]
	(or arXiv:2205.06119v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.06119

Submission history

From: Manikandan Ravikiran [view email]
[v1] Thu, 12 May 2022 14:32:12 UTC (55 KB)

Computer Science > Computation and Language

Title:Zero-shot Code-Mixed Offensive Span Identification through Rationale Extraction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Zero-shot Code-Mixed Offensive Span Identification through Rationale Extraction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators