HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding

Tang, Huijie; Berto, Federico; Ma, Zihan; Hua, Chuanbo; Ahn, Kyuree; Park, Jinkyoo

Computer Science > Multiagent Systems

arXiv:2402.15546 (cs)

[Submitted on 23 Feb 2024]

Title:HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding

Authors:Huijie Tang, Federico Berto, Zihan Ma, Chuanbo Hua, Kyuree Ahn, Jinkyoo Park

View PDF HTML (experimental)

Abstract:Large-scale multi-agent pathfinding (MAPF) presents significant challenges in several areas. As systems grow in complexity with a multitude of autonomous agents operating simultaneously, efficient and collision-free coordination becomes paramount. Traditional algorithms often fall short in scalability, especially in intricate scenarios. Reinforcement Learning (RL) has shown potential to address the intricacies of MAPF; however, it has also been shown to struggle with scalability, demanding intricate implementation, lengthy training, and often exhibiting unstable convergence, limiting its practical application. In this paper, we introduce Heuristics-Informed Multi-Agent Pathfinding (HiMAP), a novel scalable approach that employs imitation learning with heuristic guidance in a decentralized manner. We train on small-scale instances using a heuristic policy as a teacher that maps each single agent observation information to an action probability distribution. During pathfinding, we adopt several inference techniques to improve performance. With a simple training scheme and implementation, HiMAP demonstrates competitive results in terms of success rate and scalability in the field of imitation-learning-only MAPF, showing the potential of imitation-learning-only MAPF equipped with inference techniques.

Comments:	Accepted as Extended Abstract in Proc. of the 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024)
Subjects:	Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2402.15546 [cs.MA]
	(or arXiv:2402.15546v1 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.2402.15546

Submission history

From: Federico Berto [view email]
[v1] Fri, 23 Feb 2024 13:01:13 UTC (201 KB)

Computer Science > Multiagent Systems

Title:HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators