Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent

Jing, Gangshan; Bai, He; George, Jemin; Chakrabortty, Aranya; Sharma, Piyush K.

doi:10.1109/TAC.2024.3386061

Electrical Engineering and Systems Science > Systems and Control

arXiv:2107.12416 (eess)

[Submitted on 26 Jul 2021 (v1), last revised 3 May 2024 (this version, v4)]

Title:Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent

Authors:Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty, Piyush K. Sharma

View PDF HTML (experimental)

Abstract:Recently introduced distributed zeroth-order optimization (ZOO) algorithms have shown their utility in distributed reinforcement learning (RL). Unfortunately, in the gradient estimation process, almost all of them require random samples with the same dimension as the global variable and/or require evaluation of the global cost function, which may induce high estimation variance for large-scale networks. In this paper, we propose a novel distributed zeroth-order algorithm by leveraging the network structure inherent in the optimization objective, which allows each agent to estimate its local gradient by local cost evaluation independently, without use of any consensus protocol. The proposed algorithm exhibits an asynchronous update scheme, and is designed for stochastic non-convex optimization with a possibly non-convex feasible domain based on the block coordinate descent method. The algorithm is later employed as a distributed model-free RL algorithm for distributed linear quadratic regulator design, where a learning graph is designed to describe the required interaction relationship among agents in distributed learning. We provide an empirical validation of the proposed algorithm to benchmark its performance on convergence rate and variance against a centralized ZOO algorithm.

Comments:	The arxiv version contains proofs of Lemma 3 and Lemma 5, which are missing in the published version
Subjects:	Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2107.12416 [eess.SY]
	(or arXiv:2107.12416v4 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2107.12416
Related DOI:	https://doi.org/10.1109/TAC.2024.3386061

Submission history

From: Gangshan Jing [view email]
[v1] Mon, 26 Jul 2021 18:11:07 UTC (4,386 KB)
[v2] Wed, 28 Jul 2021 14:42:19 UTC (4,387 KB)
[v3] Tue, 25 Oct 2022 11:22:06 UTC (4,558 KB)
[v4] Fri, 3 May 2024 03:56:09 UTC (4,649 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators