The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning with Efficient Communication

Xu, Xing; Li, Rongpeng; Zhao, Zhifeng; Zhang, Honggang

Computer Science > Machine Learning

arXiv:2103.13026 (cs)

[Submitted on 24 Mar 2021 (v1), last revised 29 May 2023 (this version, v2)]

Title:The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning with Efficient Communication

Authors:Xing Xu, Rongpeng Li, Zhifeng Zhao, Honggang Zhang

View PDF

Abstract:The paper considers independent reinforcement learning (IRL) for multi-agent collaborative decision-making in the paradigm of federated learning (FL). However, FL generates excessive communication overheads between agents and a remote central server, especially when it involves a large number of agents or iterations. Besides, due to the heterogeneity of independent learning environments, multiple agents may undergo asynchronous Markov decision processes (MDPs), which will affect the training samples and the model's convergence performance. On top of the variation-aware periodic averaging (VPA) method and the policy-based deep reinforcement learning (DRL) algorithm (i.e., proximal policy optimization (PPO)), this paper proposes two advanced optimization schemes orienting to stochastic gradient descent (SGD): 1) A decay-based scheme gradually decays the weights of a model's local gradients with the progress of successive local updates, and 2) By representing the agents as a graph, a consensus-based scheme studies the impact of exchanging a model's local gradients among nearby agents from an algebraic connectivity perspective. This paper also provides novel convergence guarantees for both developed schemes, and demonstrates their superior effectiveness and efficiency in improving the system's utility value through theoretical analyses and simulation results.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
Cite as:	arXiv:2103.13026 [cs.LG]
	(or arXiv:2103.13026v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2103.13026

Submission history

From: Xing Xu [view email]
[v1] Wed, 24 Mar 2021 07:21:43 UTC (828 KB)
[v2] Mon, 29 May 2023 12:53:01 UTC (2,328 KB)

Computer Science > Machine Learning

Title:The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning with Efficient Communication

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning with Efficient Communication

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators