Scalable Planning in Multi-Agent MDPs

Sahabandu, Dinuka; Niu, Luyao; Clark, Andrew; Poovendran, Radha

Computer Science > Multiagent Systems

arXiv:2103.15894 (cs)

[Submitted on 29 Mar 2021]

Title:Scalable Planning in Multi-Agent MDPs

Authors:Dinuka Sahabandu, Luyao Niu, Andrew Clark, Radha Poovendran

View PDF

Abstract:Multi-agent Markov Decision Processes (MMDPs) arise in a variety of applications including target tracking, control of multi-robot swarms, and multiplayer games. A key challenge in MMDPs occurs when the state and action spaces grow exponentially in the number of agents, making computation of an optimal policy computationally intractable for medium- to large-scale problems. One property that has been exploited to mitigate this complexity is transition independence, in which each agent's transition probabilities are independent of the states and actions of other agents. Transition independence enables factorization of the MMDP and computation of local agent policies but does not hold for arbitrary MMDPs. In this paper, we propose an approximate transition dependence property, called $\delta$-transition dependence and develop a metric for quantifying how far an MMDP deviates from transition independence. Our definition of $\delta$-transition dependence recovers transition independence as a special case when $\delta$ is zero. We develop a polynomial time algorithm in the number of agents that achieves a provable bound on the global optimum when the reward functions are monotone increasing and submodular in the agent actions. We evaluate our approach on two case studies, namely, multi-robot control and multi-agent patrolling example.

Subjects:	Multiagent Systems (cs.MA)
Cite as:	arXiv:2103.15894 [cs.MA]
	(or arXiv:2103.15894v1 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.2103.15894

Submission history

From: Dinuka Sahabandu [view email]
[v1] Mon, 29 Mar 2021 19:04:39 UTC (4,504 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.MA

< prev | next >

new | recent | 2021-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Dinuka Sahabandu
Luyao Niu
Andrew Clark
Radha Poovendran

export BibTeX citation

Computer Science > Multiagent Systems

Title:Scalable Planning in Multi-Agent MDPs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:Scalable Planning in Multi-Agent MDPs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators