Bias Against 93 Stigmatized Groups in Masked Language Models and Downstream Sentiment Classification Tasks

Mei, Katelyn X.; Fereidooni, Sonia; Caliskan, Aylin

doi:10.1145/3593013.3594109 10.1145/3593013.3594109 10.1145/3593013.3594109

Computer Science > Computers and Society

arXiv:2306.05550 (cs)

[Submitted on 8 Jun 2023]

Title:Bias Against 93 Stigmatized Groups in Masked Language Models and Downstream Sentiment Classification Tasks

Authors:Katelyn X. Mei, Sonia Fereidooni, Aylin Caliskan

View PDF

Abstract:The rapid deployment of artificial intelligence (AI) models demands a thorough investigation of biases and risks inherent in these models to understand their impact on individuals and society. This study extends the focus of bias evaluation in extant work by examining bias against social stigmas on a large scale. It focuses on 93 stigmatized groups in the United States, including a wide range of conditions related to disease, disability, drug use, mental illness, religion, sexuality, socioeconomic status, and other relevant factors. We investigate bias against these groups in English pre-trained Masked Language Models (MLMs) and their downstream sentiment classification tasks. To evaluate the presence of bias against 93 stigmatized conditions, we identify 29 non-stigmatized conditions to conduct a comparative analysis. Building upon a psychology scale of social rejection, the Social Distance Scale, we prompt six MLMs: RoBERTa-base, RoBERTa-large, XLNet-large, BERTweet-base, BERTweet-large, and DistilBERT. We use human annotations to analyze the predicted words from these models, with which we measure the extent of bias against stigmatized groups. When prompts include stigmatized conditions, the probability of MLMs predicting negative words is approximately 20 percent higher than when prompts have non-stigmatized conditions. In the sentiment classification tasks, when sentences include stigmatized conditions related to diseases, disability, education, and mental illness, they are more likely to be classified as negative. We also observe a strong correlation between bias in MLMs and their downstream sentiment classifiers (r =0.79). The evidence indicates that MLMs and their downstream sentiment classification tasks exhibit biases against socially stigmatized groups.

Comments:	20 pages,12 figures,2 tables; ACM FAccT 2023
Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
ACM classes:	K.4; I.2.7; I.2.0
Cite as:	arXiv:2306.05550 [cs.CY]
	(or arXiv:2306.05550v1 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2306.05550
Related DOI:	https://doi.org/10.1145/3593013.3594109 https://doi.org/10.1145/3593013.3594109 https://doi.org/10.1145/3593013.3594109

Submission history

From: Katelyn X. Mei [view email]
[v1] Thu, 8 Jun 2023 20:46:09 UTC (1,210 KB)

Computer Science > Computers and Society

Title:Bias Against 93 Stigmatized Groups in Masked Language Models and Downstream Sentiment Classification Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:Bias Against 93 Stigmatized Groups in Masked Language Models and Downstream Sentiment Classification Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators