The Impact of Persona-based Political Perspectives on Hateful Content Detection

Civelli, Stefano; Bernardelle, Pietro; Demartini, Gianluca

doi:10.1145/3701716.3718383

Computer Science > Computation and Language

arXiv:2502.00385 (cs)

[Submitted on 1 Feb 2025 (v1), last revised 26 Feb 2025 (this version, v2)]

Title:The Impact of Persona-based Political Perspectives on Hateful Content Detection

Authors:Stefano Civelli, Pietro Bernardelle, Gianluca Demartini

View PDF HTML (experimental)

Abstract:While pretraining language models with politically diverse content has been shown to improve downstream task fairness, such approaches require significant computational resources often inaccessible to many researchers and organizations. Recent work has established that persona-based prompting can introduce political diversity in model outputs without additional training. However, it remains unclear whether such prompting strategies can achieve results comparable to political pretraining for downstream tasks. We investigate this question using persona-based prompting strategies in multimodal hate-speech detection tasks, specifically focusing on hate speech in memes. Our analysis reveals that when mapping personas onto a political compass and measuring persona agreement, inherent political positioning has surprisingly little correlation with classification decisions. Notably, this lack of correlation persists even when personas are explicitly injected with stronger ideological descriptors. Our findings suggest that while LLMs can exhibit political biases in their responses to direct political questions, these biases may have less impact on practical classification tasks than previously assumed. This raises important questions about the necessity of computationally expensive political pretraining for achieving fair performance in downstream tasks.

Comments:	Companion Proceedings of the ACM Web Conference 2025 (WWW Companion'25)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.00385 [cs.CL]
	(or arXiv:2502.00385v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.00385
Related DOI:	https://doi.org/10.1145/3701716.3718383

Submission history

From: Stefano Civelli [view email]
[v1] Sat, 1 Feb 2025 09:53:17 UTC (1,483 KB)
[v2] Wed, 26 Feb 2025 03:23:41 UTC (1,508 KB)

Computer Science > Computation and Language

Title:The Impact of Persona-based Political Perspectives on Hateful Content Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:The Impact of Persona-based Political Perspectives on Hateful Content Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators