Sample-Efficient Reinforcement Learning of Partially Observable Markov Games

Liu, Qinghua; Szepesvári, Csaba; Jin, Chi

Computer Science > Machine Learning

arXiv:2206.01315 (cs)

[Submitted on 2 Jun 2022 (v1), last revised 17 Oct 2022 (this version, v2)]

Title:Sample-Efficient Reinforcement Learning of Partially Observable Markov Games

Authors:Qinghua Liu, Csaba Szepesvári, Chi Jin

View PDF

Abstract:This paper considers the challenging tasks of Multi-Agent Reinforcement Learning (MARL) under partial observability, where each agent only sees her own individual observations and actions that reveal incomplete information about the underlying state of system. This paper studies these tasks under the general model of multiplayer general-sum Partially Observable Markov Games (POMGs), which is significantly larger than the standard model of Imperfect Information Extensive-Form Games (IIEFGs). We identify a rich subclass of POMGs -- weakly revealing POMGs -- in which sample-efficient learning is tractable. In the self-play setting, we prove that a simple algorithm combining optimism and Maximum Likelihood Estimation (MLE) is sufficient to find approximate Nash equilibria, correlated equilibria, as well as coarse correlated equilibria of weakly revealing POMGs, in a polynomial number of samples when the number of agents is small. In the setting of playing against adversarial opponents, we show that a variant of our optimistic MLE algorithm is capable of achieving sublinear regret when being compared against the optimal maximin policies. To our best knowledge, this work provides the first line of sample-efficient results for learning POMGs.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Machine Learning (stat.ML)
Cite as:	arXiv:2206.01315 [cs.LG]
	(or arXiv:2206.01315v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.01315

Submission history

From: Qinghua Liu [view email]
[v1] Thu, 2 Jun 2022 21:57:47 UTC (343 KB)
[v2] Mon, 17 Oct 2022 16:22:14 UTC (385 KB)

Computer Science > Machine Learning

Title:Sample-Efficient Reinforcement Learning of Partially Observable Markov Games

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sample-Efficient Reinforcement Learning of Partially Observable Markov Games

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators