Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

Jordan, Philip; Grötschla, Florian; Fan, Flint Xiaofeng; Wattenhofer, Roger

Computer Science > Machine Learning

arXiv:2401.03489 (cs)

[Submitted on 7 Jan 2024]

Title:Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

Authors:Philip Jordan, Florian Grötschla, Flint Xiaofeng Fan, Roger Wattenhofer

View PDF HTML (experimental)

Abstract:In Federated Reinforcement Learning (FRL), agents aim to collaboratively learn a common task, while each agent is acting in its local environment without exchanging raw trajectories. Existing approaches for FRL either (a) do not provide any fault-tolerance guarantees (against misbehaving agents), or (b) rely on a trusted central agent (a single point of failure) for aggregating updates. We provide the first decentralized Byzantine fault-tolerant FRL method. Towards this end, we first propose a new centralized Byzantine fault-tolerant policy gradient (PG) algorithm that improves over existing methods by relying only on assumptions standard for non-fault-tolerant PG. Then, as our main contribution, we show how a combination of robust aggregation and Byzantine-resilient agreement methods can be leveraged in order to eliminate the need for a trusted central entity. Since our results represent the first sample complexity analysis for Byzantine fault-tolerant decentralized federated non-convex optimization, our technical contributions may be of independent interest. Finally, we corroborate our theoretical results experimentally for common RL environments, demonstrating the speed-up of decentralized federations w.r.t. the number of participating agents and resilience against various Byzantine attacks.

Comments:	Accepted at AAMAS'24
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA)
Cite as:	arXiv:2401.03489 [cs.LG]
	(or arXiv:2401.03489v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2401.03489

Submission history

From: Philip Jordan [view email]
[v1] Sun, 7 Jan 2024 14:06:06 UTC (5,011 KB)

Computer Science > Machine Learning

Title:Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators