Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning

Zhu, Ting; Jin, Yue; Houssineau, Jeremie; Montana, Giovanni

Computer Science > Machine Learning

arXiv:2411.11099 (cs)

[Submitted on 17 Nov 2024]

Title:Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning

Authors:Ting Zhu, Yue Jin, Jeremie Houssineau, Giovanni Montana

View PDF HTML (experimental)

Abstract:In decentralized multi-agent reinforcement learning, agents learning in isolation can lead to relative over-generalization (RO), where optimal joint actions are undervalued in favor of suboptimal ones. This hinders effective coordination in cooperative tasks, as agents tend to choose actions that are individually rational but collectively suboptimal. To address this issue, we introduce MaxMax Q-Learning (MMQ), which employs an iterative process of sampling and evaluating potential next states, selecting those with maximal Q-values for learning. This approach refines approximations of ideal state transitions, aligning more closely with the optimal joint policy of collaborating agents. We provide theoretical analysis supporting MMQ's potential and present empirical evaluations across various environments susceptible to RO. Our results demonstrate that MMQ frequently outperforms existing baselines, exhibiting enhanced convergence and sample efficiency.

Comments:	Published in Transactions on Machine Learning Research (11/2024)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2411.11099 [cs.LG]
	(or arXiv:2411.11099v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.11099

Submission history

From: Giovanni Montana [view email]
[v1] Sun, 17 Nov 2024 15:00:39 UTC (28,019 KB)

Computer Science > Machine Learning

Title:Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators