Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients

Li, Dongdong; Dong, Jiuxiang

Electrical Engineering and Systems Science > Systems and Control

arXiv:2412.20845 (eess)

[Submitted on 30 Dec 2024 (v1), last revised 19 Mar 2025 (this version, v2)]

Title:Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients

Authors:Dongdong Li, Jiuxiang Dong

View PDF HTML (experimental)

Abstract:Policy iteration is one of the classical frameworks of reinforcement learning, which requires a known initial stabilizing control. However, finding the initial stabilizing control depends on the known system model. To relax this requirement and achieve model-free optimal control, in this paper, two different reinforcement learning algorithms based on policy iteration and variable damping coefficients are designed for unknown discrete-time linear systems. First, a stable artificial system is designed, and this system is gradually iterated to the original system by varying the damping coefficients. This allows the initial stabilizing control to be obtained in a finite number of iteration steps. Then, an off-policy iteration algorithm and an off-policy $\mathcal{Q}$-learning algorithm are designed to select the appropriate damping coefficients and realize data-driven. In these two algorithms, the current estimates of optimal control gain are not applied to the system to re-collect data. Moreover, they are characterized by the fast convergence of the traditional policy iteration. Finally, the proposed algorithms are validated by simulation.

Subjects:	Systems and Control (eess.SY)
Cite as:	arXiv:2412.20845 [eess.SY]
	(or arXiv:2412.20845v2 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2412.20845

Submission history

From: Dongdong Li [view email]
[v1] Mon, 30 Dec 2024 10:28:01 UTC (2,764 KB)
[v2] Wed, 19 Mar 2025 09:31:38 UTC (306 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators