Variational Offline Multi-agent Skill Discovery

Chen, Jiayu; Ganguly, Bhargav; Lan, Tian; Aggarwal, Vaneet

Computer Science > Machine Learning

arXiv:2405.16386 (cs)

[Submitted on 26 May 2024 (v1), last revised 15 Oct 2024 (this version, v2)]

Title:Variational Offline Multi-agent Skill Discovery

Authors:Jiayu Chen, Bhargav Ganguly, Tian Lan, Vaneet Aggarwal

View PDF HTML (experimental)

Abstract:Skills are effective temporal abstractions established for sequential decision making, which enable efficient hierarchical learning for long-horizon tasks and facilitate multi-task learning through their transferability. Despite extensive research, research gaps remain in multi-agent scenarios, particularly for automatically extracting subgroup coordination patterns in a multi-agent task. In this case, we propose two novel auto-encoder schemes: VO-MASD-3D and VO-MASD-Hier, to simultaneously capture subgroup- and temporal-level abstractions and form multi-agent skills, which firstly solves the aforementioned challenge. An essential algorithm component of these schemes is a dynamic grouping function that can automatically detect latent subgroups based on agent interactions in a task. Our method can be applied to offline multi-task data, and the discovered subgroup skills can be transferred across relevant tasks without retraining. Empirical evaluations on StarCraft tasks indicate that our approach significantly outperforms existing hierarchical multi-agent reinforcement learning (MARL) methods. Moreover, skills discovered using our method can effectively reduce the learning difficulty in MARL scenarios with delayed and sparse reward signals.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.16386 [cs.LG]
	(or arXiv:2405.16386v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.16386

Submission history

From: Jiayu Chen [view email]
[v1] Sun, 26 May 2024 00:24:46 UTC (846 KB)
[v2] Tue, 15 Oct 2024 04:08:33 UTC (6,736 KB)

Computer Science > Machine Learning

Title:Variational Offline Multi-agent Skill Discovery

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Variational Offline Multi-agent Skill Discovery

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators