Policy Iteration for Pareto-Optimal Policies in Stochastic Stackelberg Games

Kudo, Mikoto; Akimoto, Yohei

Computer Science > Computer Science and Game Theory

arXiv:2405.06689 (cs)

[Submitted on 7 May 2024]

Title:Policy Iteration for Pareto-Optimal Policies in Stochastic Stackelberg Games

Authors:Mikoto Kudo, Yohei Akimoto

View PDF HTML (experimental)

Abstract:In general-sum stochastic games, a stationary Stackelberg equilibrium (SSE) does not always exist, in which the leader maximizes leader's return for all the initial states when the follower takes the best response against the leader's policy. Existing methods of determining the SSEs require strong assumptions to guarantee the convergence and the coincidence of the limit with the SSE. Moreover, our analysis suggests that the performance at the fixed points of these methods is not reasonable when they are not SSEs. Herein, we introduced the concept of Pareto-optimality as a reasonable alternative to SSEs. We derive the policy improvement theorem for stochastic games with the best-response follower and propose an iterative algorithm to determine the Pareto-optimal policies based on it. Monotone improvement and convergence of the proposed approach are proved, and its convergence to SSEs is proved in a special case.

Comments:	21 pages
Subjects:	Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Optimization and Control (math.OC)
Cite as:	arXiv:2405.06689 [cs.GT]
	(or arXiv:2405.06689v1 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2405.06689

Submission history

From: Mikoto Kudo [view email]
[v1] Tue, 7 May 2024 07:40:42 UTC (49 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.GT

< prev | next >

new | recent | 2024-05

Change to browse by:

cs
cs.LG
cs.MA
math
math.OC

References & Citations

export BibTeX citation

Computer Science > Computer Science and Game Theory

Title:Policy Iteration for Pareto-Optimal Policies in Stochastic Stackelberg Games

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:Policy Iteration for Pareto-Optimal Policies in Stochastic Stackelberg Games

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators