Policy Evaluation in Decentralized POMDPs with Belief Sharing

Kayaalp, Mert; Ghadieh, Fatima; Sayed, Ali H.

Computer Science > Machine Learning

arXiv:2302.04151 (cs)

[Submitted on 8 Feb 2023 (v1), last revised 16 May 2023 (this version, v2)]

Title:Policy Evaluation in Decentralized POMDPs with Belief Sharing

Authors:Mert Kayaalp, Fatima Ghadieh, Ali H. Sayed

View PDF

Abstract:Most works on multi-agent reinforcement learning focus on scenarios where the state of the environment is fully observable. In this work, we consider a cooperative policy evaluation task in which agents are not assumed to observe the environment state directly. Instead, agents can only have access to noisy observations and to belief vectors. It is well-known that finding global posterior distributions under multi-agent settings is generally NP-hard. As a remedy, we propose a fully decentralized belief forming strategy that relies on individual updates and on localized interactions over a communication network. In addition to the exchange of the beliefs, agents exploit the communication network by exchanging value function parameter estimates as well. We analytically show that the proposed strategy allows information to diffuse over the network, which in turn allows the agents' parameters to have a bounded difference with a centralized baseline. A multi-sensor target tracking application is considered in the simulations.

Comments:	Accepted for publication in IEEE Open Journal of Control Systems, Special Section: Intersection of Machine Learning with Control
Subjects:	Machine Learning (cs.LG); Multiagent Systems (cs.MA); Signal Processing (eess.SP); Systems and Control (eess.SY)
Cite as:	arXiv:2302.04151 [cs.LG]
	(or arXiv:2302.04151v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.04151

Submission history

From: Mert Kayaalp [view email]
[v1] Wed, 8 Feb 2023 15:54:15 UTC (1,363 KB)
[v2] Tue, 16 May 2023 11:43:37 UTC (1,422 KB)

Computer Science > Machine Learning

Title:Policy Evaluation in Decentralized POMDPs with Belief Sharing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Policy Evaluation in Decentralized POMDPs with Belief Sharing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators