Exploring Gender Bias in Retrieval Models

Sundararaman, Dhanasekar; Subramanian, Vivek

Computer Science > Computation and Language

arXiv:2208.01755v1 (cs)

[Submitted on 2 Aug 2022 (this version), latest version 20 Sep 2022 (v3)]

Title:Exploring Gender Bias in Retrieval Models

Authors:Dhanasekar Sundararaman, Vivek Subramanian

View PDF

Abstract:Biases in culture, gender, ethnicity, etc. have existed for decades and have affected many areas of human social interaction. These biases have been shown to impact machine learning (ML) models, and for natural language processing (NLP), this can have severe consequences for downstream tasks. Mitigating gender bias in information retrieval (IR) is important to avoid propagating stereotypes. In this work, we employ a dataset consisting of two components: (1) relevance of a document to a query and (2) "gender" of a document, in which pronouns are replaced by male, female, and neutral conjugations. We definitively show that pre-trained models for IR do not perform well in zero-shot retrieval tasks when full fine-tuning of a large pre-trained BERT encoder is performed and that lightweight fine-tuning performed with adapter networks improves zero-shot retrieval performance almost by 20% over baseline. We also illustrate that pre-trained models have gender biases that result in retrieved articles tending to be more often male than female. We overcome this by introducing a debiasing technique that penalizes the model when it prefers males over females, resulting in an effective model that retrieves articles in a balanced fashion across genders.

Subjects:	Computation and Language (cs.CL); Information Retrieval (cs.IR)
Cite as:	arXiv:2208.01755 [cs.CL]
	(or arXiv:2208.01755v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2208.01755

Submission history

From: Dhanasekar Sundararaman [view email]
[v1] Tue, 2 Aug 2022 21:12:05 UTC (1,958 KB)
[v2] Sat, 6 Aug 2022 05:31:53 UTC (1,454 KB)
[v3] Tue, 20 Sep 2022 07:17:13 UTC (1,866 KB)

Computer Science > Computation and Language

Title:Exploring Gender Bias in Retrieval Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Exploring Gender Bias in Retrieval Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators