Distributed Online Rollout for Multivehicle Routing in Unmapped Environments

Weber, Jamison W.; Giriyan, Dhanush R.; Parkar, Devendra R.; Bertsekas, Dimitri P.; Richa, Andréa W.

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2305.15596 (cs)

[Submitted on 24 May 2023 (v1), last revised 24 Feb 2024 (this version, v3)]

Title:Distributed Online Rollout for Multivehicle Routing in Unmapped Environments

Authors:Jamison W. Weber, Dhanush R. Giriyan, Devendra R. Parkar, Dimitri P. Bertsekas, Andréa W. Richa

View PDF HTML (experimental)

Abstract:In this work we consider a generalization of the well-known multivehicle routing problem: given a network, a set of agents occupying a subset of its nodes, and a set of tasks, we seek a minimum cost sequence of movements subject to the constraint that each task is visited by some agent at least once. The classical version of this problem assumes a central computational server that observes the entire state of the system perfectly and directs individual agents according to a centralized control scheme. In contrast, we assume that there is no centralized server and that each agent is an individual processor with no a priori knowledge of the underlying network (including task and agent locations). Moreover, our agents possess strictly local communication and sensing capabilities (restricted to a fixed radius around their respective locations), aligning more closely with several real-world multiagent applications. These restrictions introduce many challenges that are overcome through local information sharing and direct coordination between agents. We present a fully distributed, online, and scalable reinforcement learning algorithm for this problem whereby agents self-organize into local clusters and independently apply a multiagent rollout scheme locally to each cluster. We demonstrate empirically via extensive simulations that there exists a critical sensing radius beyond which the distributed rollout algorithm begins to improve over a greedy base policy. This critical sensing radius grows proportionally to the $\log^*$ function of the size of the network, and is, therefore, a small constant for any relevant network. Our decentralized reinforcement learning algorithm achieves approximately a factor of two cost improvement over the base policy for a range of radii bounded from below and above by two and three times the critical sensing radius, respectively.

Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
Cite as:	arXiv:2305.15596 [cs.DC]
	(or arXiv:2305.15596v3 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2305.15596

Submission history

From: Jamison Weber [view email]
[v1] Wed, 24 May 2023 22:06:44 UTC (4,278 KB)
[v2] Mon, 12 Feb 2024 18:03:29 UTC (4,112 KB)
[v3] Sat, 24 Feb 2024 01:57:58 UTC (3,862 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Distributed Online Rollout for Multivehicle Routing in Unmapped Environments

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Distributed Online Rollout for Multivehicle Routing in Unmapped Environments

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators