Model-Free Learning for the Linear Quadratic Regulator over Rate-Limited Channels

Ye, Lintao; Mitra, Aritra; Gupta, Vijay

Mathematics > Optimization and Control

arXiv:2401.01258 (math)

[Submitted on 2 Jan 2024 (v1), last revised 19 Sep 2024 (this version, v3)]

Title:Model-Free Learning for the Linear Quadratic Regulator over Rate-Limited Channels

Authors:Lintao Ye, Aritra Mitra, Vijay Gupta

View PDF HTML (experimental)

Abstract:Consider a linear quadratic regulator (LQR) problem being solved in a model-free manner using the policy gradient approach. If the gradient of the quadratic cost is being transmitted across a rate-limited channel, both the convergence and the rate of convergence of the resulting controller may be affected by the bit-rate permitted by the channel. We first pose this problem in a communication-constrained optimization framework and propose a new adaptive quantization algorithm titled Adaptively Quantized Gradient Descent (AQGD). This algorithm guarantees exponentially fast convergence to the globally optimal policy, with no deterioration of the exponent relative to the unquantized setting, above a certain finite threshold bit-rate allowed by the communication channel. We then propose a variant of AQGD that provides similar performance guarantees when applied to solve the model-free LQR problem. Our approach reveals the benefits of adaptive quantization in preserving fast linear convergence rates, and, as such, may be of independent interest to the literature on compressed optimization. Our work also marks a first step towards a more general bridge between the fields of model-free control design and networked control systems.

Comments:	37 pages, 3 figures. Compared to the previous version, we extend the general AQGD algorithm to handle noisy gradients and prove its convergence. In addition, we apply the algorithm to solve model-free LQR with communication constraints and provide finite sample analysis regarding the convergence of the algorithm
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2401.01258 [math.OC]
	(or arXiv:2401.01258v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2401.01258

Submission history

From: Lintao Ye [view email]
[v1] Tue, 2 Jan 2024 15:59:00 UTC (100 KB)
[v2] Fri, 2 Aug 2024 03:39:18 UTC (86 KB)
[v3] Thu, 19 Sep 2024 13:36:21 UTC (300 KB)

Mathematics > Optimization and Control

Title:Model-Free Learning for the Linear Quadratic Regulator over Rate-Limited Channels

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Model-Free Learning for the Linear Quadratic Regulator over Rate-Limited Channels

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators