Learning a Low-dimensional Representation of a Safe Region for Safe Reinforcement Learning on Dynamical Systems

Zhou, Zhehua; Oguz, Ozgur S.; Leibold, Marion; Buss, Martin

doi:10.1109/TNNLS.2021.3106818

Computer Science > Robotics

arXiv:2010.09555 (cs)

[Submitted on 19 Oct 2020 (v1), last revised 8 Sep 2021 (this version, v2)]

Title:Learning a Low-dimensional Representation of a Safe Region for Safe Reinforcement Learning on Dynamical Systems

Authors:Zhehua Zhou, Ozgur S. Oguz, Marion Leibold, Martin Buss

View PDF

Abstract:For safely applying reinforcement learning algorithms on high-dimensional nonlinear dynamical systems, a simplified system model is used to formulate a safe reinforcement learning framework. Based on the simplified system model, a low-dimensional representation of the safe region is identified and is used to provide safety estimates for learning algorithms. However, finding a satisfying simplified system model for complex dynamical systems usually requires a considerable amount of effort. To overcome this limitation, we propose in this work a general data-driven approach that is able to efficiently learn a low-dimensional representation of the safe region. Through an online adaptation method, the low-dimensional representation is updated by using the feedback data such that more accurate safety estimates are obtained. The performance of the proposed approach for identifying the low-dimensional representation of the safe region is demonstrated with a quadcopter example. The results show that, compared to previous work, a more reliable and representative low-dimensional representation of the safe region is derived, which then extends the applicability of the safe reinforcement learning framework.

Subjects:	Robotics (cs.RO); Systems and Control (eess.SY)
Cite as:	arXiv:2010.09555 [cs.RO]
	(or arXiv:2010.09555v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2010.09555
Related DOI:	https://doi.org/10.1109/TNNLS.2021.3106818

Submission history

From: Zhehua Zhou [view email]
[v1] Mon, 19 Oct 2020 14:32:05 UTC (7,452 KB)
[v2] Wed, 8 Sep 2021 06:02:21 UTC (5,442 KB)

Computer Science > Robotics

Title:Learning a Low-dimensional Representation of a Safe Region for Safe Reinforcement Learning on Dynamical Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Learning a Low-dimensional Representation of a Safe Region for Safe Reinforcement Learning on Dynamical Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators