Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models

Zhang, Yang; Bai, Chenjia; Zhao, Bin; Yan, Junchi; Li, Xiu; Li, Xuelong

Computer Science > Machine Learning

arXiv:2406.15836 (cs)

[Submitted on 22 Jun 2024]

Title:Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models

Authors:Yang Zhang, Chenjia Bai, Bin Zhao, Junchi Yan, Xiu Li, Xuelong Li

View PDF HTML (experimental)

Abstract:Learning a world model for model-free Reinforcement Learning (RL) agents can significantly improve the sample efficiency by learning policies in imagination. However, building a world model for Multi-Agent RL (MARL) can be particularly challenging due to the scalability issue in a centralized architecture arising from a large number of agents, and also the non-stationarity issue in a decentralized architecture stemming from the inter-dependency among agents. To address both challenges, we propose a novel world model for MARL that learns decentralized local dynamics for scalability, combined with a centralized representation aggregation from all agents. We cast the dynamics learning as an auto-regressive sequence modeling problem over discrete tokens by leveraging the expressive Transformer architecture, in order to model complex local dynamics across different agents and provide accurate and consistent long-term imaginations. As the first pioneering Transformer-based world model for multi-agent systems, we introduce a Perceiver Transformer as an effective solution to enable centralized representation aggregation within this context. Results on Starcraft Multi-Agent Challenge (SMAC) show that it outperforms strong model-free approaches and existing model-based methods in both sample efficiency and overall performance.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
Cite as:	arXiv:2406.15836 [cs.LG]
	(or arXiv:2406.15836v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.15836

Submission history

From: Yang Zhang [view email]
[v1] Sat, 22 Jun 2024 12:40:03 UTC (1,134 KB)

Computer Science > Machine Learning

Title:Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators