The Danger Of Arrogance: Welfare Equilibra As A Solution To Stackelberg Self-Play In Non-Coincidental Games

Levi, Jake; Lu, Chris; Willi, Timon; de Witt, Christian Schroeder; Foerster, Jakob

Computer Science > Computer Science and Game Theory

arXiv:2402.01088 (cs)

[Submitted on 2 Feb 2024 (v1), last revised 28 Mar 2024 (this version, v2)]

Title:The Danger Of Arrogance: Welfare Equilibra As A Solution To Stackelberg Self-Play In Non-Coincidental Games

Authors:Jake Levi, Chris Lu, Timon Willi, Christian Schroeder de Witt, Jakob Foerster

View PDF HTML (experimental)

Abstract:The increasing prevalence of multi-agent learning systems in society necessitates understanding how to learn effective and safe policies in general-sum multi-agent environments against a variety of opponents, including self-play. General-sum learning is difficult because of non-stationary opponents and misaligned incentives. Our first main contribution is to show that many recent approaches to general-sum learning can be derived as approximations to Stackelberg strategies, which suggests a framework for developing new multi-agent learning algorithms. We then define non-coincidental games as games in which the Stackelberg strategy profile is not a Nash Equilibrium. This notably includes several canonical matrix games and provides a normative theory for why existing algorithms fail in self-play in such games. We address this problem by introducing Welfare Equilibria (WE) as a generalisation of Stackelberg Strategies, which can recover desirable Nash Equilibria even in non-coincidental games. Finally, we introduce Welfare Function Search (WelFuSe) as a practical approach to finding desirable WE against unknown opponents, which finds more mutually desirable solutions in self-play, while preserving performance against naive learning opponents.

Comments:	31 pages, 23 figures
Subjects:	Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)
Cite as:	arXiv:2402.01088 [cs.GT]
	(or arXiv:2402.01088v2 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2402.01088

Submission history

From: Jake Levi [view email]
[v1] Fri, 2 Feb 2024 01:09:39 UTC (4,980 KB)
[v2] Thu, 28 Mar 2024 02:37:27 UTC (5,230 KB)

Computer Science > Computer Science and Game Theory

Title:The Danger Of Arrogance: Welfare Equilibra As A Solution To Stackelberg Self-Play In Non-Coincidental Games

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:The Danger Of Arrogance: Welfare Equilibra As A Solution To Stackelberg Self-Play In Non-Coincidental Games

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators