Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information

Iskander, Shadi; Radinsky, Kira; Belinkov, Yonatan

Computer Science > Computation and Language

arXiv:2403.09516v1 (cs)

[Submitted on 14 Mar 2024 (this version), latest version 5 Apr 2024 (v3)]

Title:Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information

Authors:Shadi Iskander, Kira Radinsky, Yonatan Belinkov

View PDF HTML (experimental)

Abstract:Mitigating social biases typically requires identifying the social groups associated with each data sample. In this paper, we present DAFair, a novel approach to address social bias in language models. Unlike traditional methods that rely on explicit demographic labels, our approach does not require any such information. Instead, we leverage predefined prototypical demographic texts and incorporate a regularization term during the fine-tuning process to mitigate bias in the model's representations. Our empirical results across two tasks and two models demonstrate the effectiveness of our method compared to previous approaches that do not rely on labeled data. Moreover, with limited demographic-annotated data, our approach outperforms common debiasing approaches.

Subjects:	Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
Cite as:	arXiv:2403.09516 [cs.CL]
	(or arXiv:2403.09516v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.09516

Submission history

From: Shadi Iskander [view email]
[v1] Thu, 14 Mar 2024 15:58:36 UTC (8,192 KB)
[v2] Tue, 2 Apr 2024 14:14:24 UTC (9,877 KB)
[v3] Fri, 5 Apr 2024 18:35:37 UTC (9,890 KB)

Computer Science > Computation and Language

Title:Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators