Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models

Yuxuan, Cao; Jiayang, Wu; Chuen, Alistair Cheong Liang; Guanrong, Bryan Shan; Jen, Theodore Lee Chong; Shen, Sherman Chann Zhi

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.18101 (cs)

[Submitted on 25 Feb 2025]

Title:Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models

Authors:Cao Yuxuan, Wu Jiayang, Alistair Cheong Liang Chuen, Bryan Shan Guanrong, Theodore Lee Chong Jen, Sherman Chann Zhi Shen

View PDF HTML (experimental)

Abstract:Traditional online content moderation systems struggle to classify modern multimodal means of communication, such as memes, a highly nuanced and information-dense medium. This task is especially hard in a culturally diverse society like Singapore, where low-resource languages are used and extensive knowledge on local context is needed to interpret online content. We curate a large collection of 112K memes labeled by GPT-4V for fine-tuning a VLM to classify offensive memes in Singapore context. We show the effectiveness of fine-tuned VLMs on our dataset, and propose a pipeline containing OCR, translation and a 7-billion parameter-class VLM. Our solutions reach 80.62% accuracy and 0.8192 AUROC on a held-out test set, and can greatly aid human in moderating online contents. The dataset, code, and model weights will be open-sourced at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
Cite as:	arXiv:2502.18101 [cs.CV]
	(or arXiv:2502.18101v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.18101

Submission history

From: Bryan Shan [view email]
[v1] Tue, 25 Feb 2025 11:15:49 UTC (1,554 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators