Distributive Fairness in Large Language Models: Evaluating Alignment with Human Values

Hosseini, Hadi; Khanna, Samarth

Computer Science > Computer Science and Game Theory

arXiv:2502.00313 (cs)

[Submitted on 1 Feb 2025]

Title:Distributive Fairness in Large Language Models: Evaluating Alignment with Human Values

Authors:Hadi Hosseini, Samarth Khanna

View PDF HTML (experimental)

Abstract:The growing interest in employing large language models (LLMs) for decision-making in social and economic contexts has raised questions about their potential to function as agents in these domains. A significant number of societal problems involve the distribution of resources, where fairness, along with economic efficiency, play a critical role in the desirability of outcomes. In this paper, we examine whether LLM responses adhere to fundamental fairness concepts such as equitability, envy-freeness, and Rawlsian maximin, and investigate their alignment with human preferences. We evaluate the performance of several LLMs, providing a comparative benchmark of their ability to reflect these measures. Our results demonstrate a lack of alignment between current LLM responses and human distributional preferences. Moreover, LLMs are unable to utilize money as a transferable resource to mitigate inequality. Nonetheless, we demonstrate a stark contrast when (some) LLMs are tasked with selecting from a predefined menu of options rather than generating one. In addition, we analyze the robustness of LLM responses to variations in semantic factors (e.g. intentions or personas) or non-semantic prompting changes (e.g. templates or orderings). Finally, we highlight potential strategies aimed at enhancing the alignment of LLM behavior with well-established fairness concepts.

Subjects:	Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multiagent Systems (cs.MA)
Cite as:	arXiv:2502.00313 [cs.GT]
	(or arXiv:2502.00313v1 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2502.00313

Submission history

From: Samarth Khanna [view email]
[v1] Sat, 1 Feb 2025 04:24:47 UTC (1,941 KB)

Computer Science > Computer Science and Game Theory

Title:Distributive Fairness in Large Language Models: Evaluating Alignment with Human Values

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:Distributive Fairness in Large Language Models: Evaluating Alignment with Human Values

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators