SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models

An, Haozhe; Li, Zongxia; Zhao, Jieyu; Rudinger, Rachel

Computer Science > Computation and Language

arXiv:2210.07269 (cs)

[Submitted on 13 Oct 2022 (v1), last revised 15 Feb 2023 (this version, v2)]

Title:SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models

Authors:Haozhe An, Zongxia Li, Jieyu Zhao, Rachel Rudinger

View PDF

Abstract:A common limitation of diagnostic tests for detecting social biases in NLP models is that they may only detect stereotypic associations that are pre-specified by the designer of the test. Since enumerating all possible problematic associations is infeasible, it is likely these tests fail to detect biases that are present in a model but not pre-specified by the designer. To address this limitation, we propose SODAPOP (SOcial bias Discovery from Answers about PeOPle) in social commonsense question-answering. Our pipeline generates modified instances from the Social IQa dataset (Sap et al., 2019) by (1) substituting names associated with different demographic groups, and (2) generating many distractor answers from a masked language model. By using a social commonsense model to score the generated distractors, we are able to uncover the model's stereotypic associations between demographic groups and an open set of words. We also test SODAPOP on debiased models and show the limitations of multiple state-of-the-art debiasing algorithms.

Comments:	EACL 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.07269 [cs.CL]
	(or arXiv:2210.07269v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.07269

Submission history

From: Haozhe An [view email]
[v1] Thu, 13 Oct 2022 18:04:48 UTC (8,017 KB)
[v2] Wed, 15 Feb 2023 21:13:26 UTC (10,751 KB)

Computer Science > Computation and Language

Title:SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators