"Other-Play" for Zero-Shot Coordination

Hu, Hengyuan; Lerer, Adam; Peysakhovich, Alex; Foerster, Jakob

Computer Science > Artificial Intelligence

arXiv:2003.02979 (cs)

[Submitted on 6 Mar 2020 (v1), last revised 12 May 2021 (this version, v3)]

Title:"Other-Play" for Zero-Shot Coordination

Authors:Hengyuan Hu, Adam Lerer, Alex Peysakhovich, Jakob Foerster

View PDF

Abstract:We consider the problem of zero-shot coordination - constructing AI agents that can coordinate with novel partners they have not seen before (e.g. humans). Standard Multi-Agent Reinforcement Learning (MARL) methods typically focus on the self-play (SP) setting where agents construct strategies by playing the game with themselves repeatedly. Unfortunately, applying SP naively to the zero-shot coordination problem can produce agents that establish highly specialized conventions that do not carry over to novel partners they have not been trained with. We introduce a novel learning algorithm called other-play (OP), that enhances self-play by looking for more robust strategies, exploiting the presence of known symmetries in the underlying problem. We characterize OP theoretically as well as experimentally. We study the cooperative card game Hanabi and show that OP agents achieve higher scores when paired with independently trained agents. In preliminary results we also show that our OP agents obtains higher average scores when paired with human players, compared to state-of-the-art SP agents.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2003.02979 [cs.AI]
	(or arXiv:2003.02979v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2003.02979

Submission history

From: Hengyuan Hu [view email]
[v1] Fri, 6 Mar 2020 00:39:37 UTC (1,052 KB)
[v2] Mon, 9 Mar 2020 17:58:40 UTC (1,052 KB)
[v3] Wed, 12 May 2021 05:22:20 UTC (1,865 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2020-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hengyuan Hu
Adam Lerer
Alex Peysakhovich
Jakob N. Foerster

export BibTeX citation

Computer Science > Artificial Intelligence

Title:"Other-Play" for Zero-Shot Coordination

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:"Other-Play" for Zero-Shot Coordination

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators