Provably Stabilizing Model-Free Q-Learning for Unknown Bilinear Systems

Clarke, Shanelle G.; Thapliyal, Omanshu; Hwang, Inseok

Electrical Engineering and Systems Science > Systems and Control

arXiv:2208.13843 (eess)

[Submitted on 29 Aug 2022]

Title:Provably Stabilizing Model-Free Q-Learning for Unknown Bilinear Systems

Authors:Shanelle G. Clarke, Omanshu Thapliyal, Inseok Hwang

View PDF

Abstract:In this paper, we present a provably convergent Model-Free ${Q}$-Learning algorithm that learns a stabilizing control policy for an unknown Bilinear System from a single online run. Given an unknown bilinear system, we study the interplay between its equivalent control-affine linear time-varying and linear time-invariant representations to derive i) from Pontryagin's Minimum Principle, a pair of point-to-point model-free policy improvement and evaluation laws that iteratively solves for an optimal state-dependent control policy; and ii) the properties under which the state-input data is sufficient to characterize system behavior in a model-free manner. We demonstrate the performance of the proposed algorithm via illustrative numerical examples and compare it to the model-based case.

Comments:	7 pages, 1 figure, Submitted to IEEE Control Systems Letters (L-CSS)
Subjects:	Systems and Control (eess.SY); Optimization and Control (math.OC)
Cite as:	arXiv:2208.13843 [eess.SY]
	(or arXiv:2208.13843v1 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2208.13843

Submission history

From: Shanelle Clarke [view email]
[v1] Mon, 29 Aug 2022 19:17:03 UTC (271 KB)

Full-text links:

Access Paper:

view license

Current browse context:

eess.SY

< prev | next >

new | recent | 2022-08

Change to browse by:

cs
cs.SY
eess
math
math.OC

References & Citations

export BibTeX citation

✅2024-10-01: arxiv.org is back to normal.✅

Electrical Engineering and Systems Science > Systems and Control

Title:Provably Stabilizing Model-Free Q-Learning for Unknown Bilinear Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Electrical Engineering and Systems Science > Systems and Control

Title:Provably Stabilizing Model-Free Q-Learning for Unknown Bilinear Systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators