Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's Dilemma

Masuda, Naoki; Nakamura, Mitsuhiro

doi:10.1016/j.jtbi.2011.03.005

Quantitative Biology > Populations and Evolution

arXiv:1012.0121 (q-bio)

[Submitted on 1 Dec 2010 (v1), last revised 3 Apr 2011 (this version, v2)]

Title:Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's Dilemma

Authors:Naoki Masuda, Mitsuhiro Nakamura

View PDF

Abstract:Humans and other animals can adapt their social behavior in response to environmental cues including the feedback obtained through experience. Nevertheless, the effects of the experience-based learning of players in evolution and maintenance of cooperation in social dilemma games remain relatively unclear. Some previous literature showed that mutual cooperation of learning players is difficult or requires a sophisticated learning model. In the context of the iterated Prisoner's Dilemma, we numerically examine the performance of a reinforcement learning model. Our model modifies those of Karandikar et al. (1998), Posch et al. (1999), and Macy and Flache (2002) in which players satisfice if the obtained payoff is larger than a dynamic threshold. We show that players obeying the modified learning mutually cooperate with high probability if the dynamics of threshold is not too fast and the association between the reinforcement signal and the action in the next round is sufficiently strong. The learning players also perform efficiently against the reactive strategy. In evolutionary dynamics, they can invade a population of players adopting simpler but competitive strategies. Our version of the reinforcement learning model does not complicate the previous model and is sufficiently simple yet flexible. It may serve to explore the relationships between learning and evolution in social dilemma situations.

Comments:	7 figures
Subjects:	Populations and Evolution (q-bio.PE)
Cite as:	arXiv:1012.0121 [q-bio.PE]
	(or arXiv:1012.0121v2 [q-bio.PE] for this version)
	https://doi.org/10.48550/arXiv.1012.0121
Journal reference:	Journal of Theoretical Biology, 278, 55-62 (2011)
Related DOI:	https://doi.org/10.1016/j.jtbi.2011.03.005

Submission history

From: Naoki Masuda Dr. [view email]
[v1] Wed, 1 Dec 2010 07:37:36 UTC (242 KB)
[v2] Sun, 3 Apr 2011 10:00:10 UTC (268 KB)

Quantitative Biology > Populations and Evolution

Title:Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's Dilemma

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Populations and Evolution

Title:Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's Dilemma

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators