Rethinking Individual Global Max in Cooperative Multi-Agent Reinforcement Learning

Hong, Yitian; Jin, Yaochu; Tang, Yang

Computer Science > Multiagent Systems

arXiv:2209.09640 (cs)

[Submitted on 20 Sep 2022]

Title:Rethinking Individual Global Max in Cooperative Multi-Agent Reinforcement Learning

Authors:Yitian Hong, Yaochu Jin, Yang Tang

View PDF

Abstract:In cooperative multi-agent reinforcement learning, centralized training and decentralized execution (CTDE) has achieved remarkable success. Individual Global Max (IGM) decomposition, which is an important element of CTDE, measures the consistency between local and joint policies. The majority of IGM-based research focuses on how to establish this consistent relationship, but little attention has been paid to examining IGM's potential flaws. In this work, we reveal that the IGM condition is a lossy decomposition, and the error of lossy decomposition will accumulated in hypernetwork-based methods. To address the above issue, we propose to adopt an imitation learning strategy to separate the lossy decomposition from Bellman iterations, thereby avoiding error accumulation. The proposed strategy is theoretically proved and empirically verified on the StarCraft Multi-Agent Challenge benchmark problem with zero sight view. The results also confirm that the proposed method outperforms state-of-the-art IGM-based approaches.

Comments:	Accept at NeurIPS 2022
Subjects:	Multiagent Systems (cs.MA)
Cite as:	arXiv:2209.09640 [cs.MA]
	(or arXiv:2209.09640v1 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.2209.09640

Submission history

From: Yitian Hong [view email]
[v1] Tue, 20 Sep 2022 11:38:50 UTC (8,868 KB)

Computer Science > Multiagent Systems

Title:Rethinking Individual Global Max in Cooperative Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:Rethinking Individual Global Max in Cooperative Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators