Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games

Gao, Zuguang; Ma, Qianqian; Başar, Tamer; Birge, John R.

Computer Science > Computer Science and Game Theory

arXiv:2112.07859 (cs)

[Submitted on 15 Dec 2021 (v1), last revised 16 Dec 2021 (this version, v2)]

Title:Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games

Authors:Zuguang Gao, Qianqian Ma, Tamer Başar, John R. Birge

View PDF

Abstract:Learning in stochastic games is arguably the most standard and fundamental setting in multi-agent reinforcement learning (MARL). In this paper, we consider decentralized MARL in stochastic games in the non-asymptotic regime. In particular, we establish the finite-sample complexity of fully decentralized Q-learning algorithms in a significant class of general-sum stochastic games (SGs) - weakly acyclic SGs, which includes the common cooperative MARL setting with an identical reward to all agents (a Markov team problem) as a special case. We focus on the practical while challenging setting of fully decentralized MARL, where neither the rewards nor the actions of other agents can be observed by each agent. In fact, each agent is completely oblivious to the presence of other decision makers. Both the tabular and the linear function approximation cases have been considered. In the tabular setting, we analyze the sample complexity for the decentralized Q-learning algorithm to converge to a Markov perfect equilibrium (Nash equilibrium). With linear function approximation, the results are for convergence to a linear approximated equilibrium - a new notion of equilibrium that we propose - which describes that each agent's policy is a best reply (to other agents) within a linear space. Numerical experiments are also provided for both settings to demonstrate the results.

Subjects:	Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
Cite as:	arXiv:2112.07859 [cs.GT]
	(or arXiv:2112.07859v2 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2112.07859

Submission history

From: Zuguang Gao [view email]
[v1] Wed, 15 Dec 2021 03:33:39 UTC (2,060 KB)
[v2] Thu, 16 Dec 2021 18:14:38 UTC (2,274 KB)

Computer Science > Computer Science and Game Theory

Title:Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators