On Measures of Biases and Harms in NLP

Dev, Sunipa; Sheng, Emily; Zhao, Jieyu; Amstutz, Aubrie; Sun, Jiao; Hou, Yu; Sanseverino, Mattie; Kim, Jiin; Nishi, Akihiro; Peng, Nanyun; Chang, Kai-Wei

Computer Science > Computation and Language

arXiv:2108.03362 (cs)

[Submitted on 7 Aug 2021 (v1), last revised 13 Oct 2022 (this version, v2)]

Title:On Measures of Biases and Harms in NLP

Authors:Sunipa Dev, Emily Sheng, Jieyu Zhao, Aubrie Amstutz, Jiao Sun, Yu Hou, Mattie Sanseverino, Jiin Kim, Akihiro Nishi, Nanyun Peng, Kai-Wei Chang

View PDF

Abstract:Recent studies show that Natural Language Processing (NLP) technologies propagate societal biases about demographic groups associated with attributes such as gender, race, and nationality. To create interventions and mitigate these biases and associated harms, it is vital to be able to detect and measure such biases. While existing works propose bias evaluation and mitigation methods for various tasks, there remains a need to cohesively understand the biases and the specific harms they measure, and how different measures compare with each other. To address this gap, this work presents a practical framework of harms and a series of questions that practitioners can answer to guide the development of bias measures. As a validation of our framework and documentation questions, we also present several case studies of how existing bias measures in NLP -- both intrinsic measures of bias in representations and extrinsic measures of bias of downstream applications -- can be aligned with different harms and how our proposed documentation questions facilitates more holistic understanding of what bias measures are measuring.

Subjects:	Computation and Language (cs.CL); Computers and Society (cs.CY)
Cite as:	arXiv:2108.03362 [cs.CL]
	(or arXiv:2108.03362v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2108.03362

Submission history

From: Sunipa Dev [view email]
[v1] Sat, 7 Aug 2021 04:08:47 UTC (5,003 KB)
[v2] Thu, 13 Oct 2022 22:38:20 UTC (5,035 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-08

Change to browse by:

cs
cs.CY

References & Citations

DBLP - CS Bibliography

listing | bibtex

Sunipa Dev
Emily Sheng
Jieyu Zhao
Jiao Sun
Yu Hou

…

export BibTeX citation

Computer Science > Computation and Language

Title:On Measures of Biases and Harms in NLP

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:On Measures of Biases and Harms in NLP

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators