Steering No-Regret Learners to a Desired Equilibrium

Zhang, Brian Hu; Farina, Gabriele; Anagnostides, Ioannis; Cacciamani, Federico; McAleer, Stephen Marcus; Haupt, Andreas Alexander; Celli, Andrea; Gatti, Nicola; Conitzer, Vincent; Sandholm, Tuomas

Computer Science > Computer Science and Game Theory

arXiv:2306.05221 (cs)

[Submitted on 8 Jun 2023 (v1), last revised 17 Feb 2024 (this version, v4)]

Title:Steering No-Regret Learners to a Desired Equilibrium

Authors:Brian Hu Zhang, Gabriele Farina, Ioannis Anagnostides, Federico Cacciamani, Stephen Marcus McAleer, Andreas Alexander Haupt, Andrea Celli, Nicola Gatti, Vincent Conitzer, Tuomas Sandholm

View PDF

Abstract:A mediator observes no-regret learners playing an extensive-form game repeatedly across $T$ rounds. The mediator attempts to steer players toward some desirable predetermined equilibrium by giving (nonnegative) payments to players. We call this the steering problem. The steering problem captures problems several problems of interest, among them equilibrium selection and information design (persuasion). If the mediator's budget is unbounded, steering is trivial because the mediator can simply pay the players to play desirable actions. We study two bounds on the mediator's payments: a total budget and a per-round budget. If the mediator's total budget does not grow with $T$, we show that steering is impossible. However, we show that it is enough for the total budget to grow sublinearly with $T$, that is, for the average payment to vanish. When players' full strategies are observed at each round, we show that constant per-round budgets permit steering. In the more challenging setting where only trajectories through the game tree are observable, we show that steering is impossible with constant per-round budgets in general extensive-form games, but possible in normal-form games or if the per-round budget may itself depend on $T$. We also show how our results can be generalized to the case when the equilibrium is being computed online while steering is happening. We supplement our theoretical positive results with experiments highlighting the efficacy of steering in large games.

Subjects:	Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:2306.05221 [cs.GT]
	(or arXiv:2306.05221v4 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2306.05221

Submission history

From: Brian Zhang [view email]
[v1] Thu, 8 Jun 2023 14:18:46 UTC (18,099 KB)
[v2] Sun, 8 Oct 2023 23:33:39 UTC (18,128 KB)
[v3] Thu, 15 Feb 2024 02:23:35 UTC (18,141 KB)
[v4] Sat, 17 Feb 2024 22:53:56 UTC (18,142 KB)

Computer Science > Computer Science and Game Theory

Title:Steering No-Regret Learners to a Desired Equilibrium

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:Steering No-Regret Learners to a Desired Equilibrium

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators