Uncovering Name-Based Biases in Large Language Models Through Simulated Trust Game

Wei, Yumou; Carvalho, Paulo F.; Stamper, John

Computer Science > Computers and Society

arXiv:2404.14682 (cs)

[Submitted on 23 Apr 2024]

Title:Uncovering Name-Based Biases in Large Language Models Through Simulated Trust Game

Authors:Yumou Wei, Paulo F. Carvalho, John Stamper

View PDF HTML (experimental)

Abstract:Gender and race inferred from an individual's name are a notable source of stereotypes and biases that subtly influence social interactions. Abundant evidence from human experiments has revealed the preferential treatment that one receives when one's name suggests a predominant gender or race. As large language models acquire more capabilities and begin to support everyday applications, it becomes crucial to examine whether they manifest similar biases when encountering names in a complex social interaction. In contrast to previous work that studies name-based biases in language models at a more fundamental level, such as word representations, we challenge three prominent models to predict the outcome of a modified Trust Game, a well-publicized paradigm for studying trust and reciprocity. To ensure the internal validity of our experiments, we have carefully curated a list of racially representative surnames to identify players in a Trust Game and rigorously verified the construct validity of our prompts. The results of our experiments show that our approach can detect name-based biases in both base and instruction-tuned models.

Subjects:	Computers and Society (cs.CY)
Cite as:	arXiv:2404.14682 [cs.CY]
	(or arXiv:2404.14682v1 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2404.14682

Submission history

From: Yumou Wei [view email]
[v1] Tue, 23 Apr 2024 02:21:17 UTC (109 KB)

Computer Science > Computers and Society

Title:Uncovering Name-Based Biases in Large Language Models Through Simulated Trust Game

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:Uncovering Name-Based Biases in Large Language Models Through Simulated Trust Game

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators