Nicer Than Humans: How do Large Language Models Behave in the Prisoner's Dilemma?

Fontana, Nicoló; Pierri, Francesco; Aiello, Luca Maria

Computer Science > Computers and Society

arXiv:2406.13605 (cs)

[Submitted on 19 Jun 2024 (v1), last revised 19 Sep 2024 (this version, v2)]

Title:Nicer Than Humans: How do Large Language Models Behave in the Prisoner's Dilemma?

Authors:Nicoló Fontana, Francesco Pierri, Luca Maria Aiello

View PDF HTML (experimental)

Abstract:The behavior of Large Language Models (LLMs) as artificial social agents is largely unexplored, and we still lack extensive evidence of how these agents react to simple social stimuli. Testing the behavior of AI agents in classic Game Theory experiments provides a promising theoretical framework for evaluating the norms and values of these agents in archetypal social situations. In this work, we investigate the cooperative behavior of three LLMs (Llama2, Llama3, and GPT3.5) when playing the Iterated Prisoner's Dilemma against random adversaries displaying various levels of hostility. We introduce a systematic methodology to evaluate an LLM's comprehension of the game rules and its capability to parse historical gameplay logs for decision-making. We conducted simulations of games lasting for 100 rounds and analyzed the LLMs' decisions in terms of dimensions defined in the behavioral economics literature. We find that all models tend not to initiate defection but act cautiously, favoring cooperation over defection only when the opponent's defection rate is low. Overall, LLMs behave at least as cooperatively as the typical human player, although our results indicate some substantial differences among models. In particular, Llama2 and GPT3.5 are more cooperative than humans, and especially forgiving and non-retaliatory for opponent defection rates below 30%. More similar to humans, Llama3 exhibits consistently uncooperative and exploitative behavior unless the opponent always cooperates. Our systematic approach to the study of LLMs in game theoretical scenarios is a step towards using these simulations to inform practices of LLM auditing and alignment.

Comments:	v1: 9 pages, 8 figures, 1 table v2: 11 pages, 14 figures, 1 table. Increased number of models studied, expanded results and conclusion, added references, corrected typos
Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Physics and Society (physics.soc-ph)
Cite as:	arXiv:2406.13605 [cs.CY]
	(or arXiv:2406.13605v2 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2406.13605

Submission history

From: Nicolò Fontana [view email]
[v1] Wed, 19 Jun 2024 14:51:14 UTC (69 KB)
[v2] Thu, 19 Sep 2024 15:19:58 UTC (125 KB)

Computer Science > Computers and Society

Title:Nicer Than Humans: How do Large Language Models Behave in the Prisoner's Dilemma?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:Nicer Than Humans: How do Large Language Models Behave in the Prisoner's Dilemma?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators