Steering No-Regret Learners to Optimal Equilibria

Zhang, Brian Hu; Farina, Gabriele; Anagnostides, Ioannis; Cacciamani, Federico; McAleer, Stephen Marcus; Haupt, Andreas Alexander; Celli, Andrea; Gatti, Nicola; Conitzer, Vincent; Sandholm, Tuomas

Computer Science > Computer Science and Game Theory

arXiv:2306.05221v1 (cs)

[Submitted on 8 Jun 2023 (this version), latest version 17 Feb 2024 (v4)]

Title:Steering No-Regret Learners to Optimal Equilibria

Authors:Brian Hu Zhang, Gabriele Farina, Ioannis Anagnostides, Federico Cacciamani, Stephen Marcus McAleer, Andreas Alexander Haupt, Andrea Celli, Nicola Gatti, Vincent Conitzer, Tuomas Sandholm

View PDF

Abstract:We consider the problem of steering no-regret-learning agents to play desirable equilibria in extensive-form games via nonnegative payments. We show that steering is impossible if the total budget (across iterations) is finite. However, with average, realized payments converging to zero, we show that steering is possible. In the full-feedback setting, that is, when players' full strategies are observed at each timestep, it is possible with constant per-iteration payments. In the bandit-feedback setting, that is, when only trajectories through the game tree are observable, steering is impossible with constant per-iteration payments but possible if we allow the maximum per-iteration payment to grow with time, while maintaining the property that average, realized payments vanish. We supplement our theoretical positive results with experiments highlighting the efficacy of steering in large, extensive-form games, and show how our framework relates to optimal mechanism design and information design.

Subjects:	Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:2306.05221 [cs.GT]
	(or arXiv:2306.05221v1 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2306.05221

Submission history

From: Brian Zhang [view email]
[v1] Thu, 8 Jun 2023 14:18:46 UTC (18,099 KB)
[v2] Sun, 8 Oct 2023 23:33:39 UTC (18,128 KB)
[v3] Thu, 15 Feb 2024 02:23:35 UTC (18,141 KB)
[v4] Sat, 17 Feb 2024 22:53:56 UTC (18,142 KB)

Computer Science > Computer Science and Game Theory

Title:Steering No-Regret Learners to Optimal Equilibria

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:Steering No-Regret Learners to Optimal Equilibria

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators