Reinforcement Learning with Ensemble Model Predictive Safety Certification

Gronauer, Sven; Haider, Tom; da Roza, Felippe Schmoeller; Diepold, Klaus

Computer Science > Machine Learning

arXiv:2402.04182 (cs)

[Submitted on 6 Feb 2024]

Title:Reinforcement Learning with Ensemble Model Predictive Safety Certification

Authors:Sven Gronauer, Tom Haider, Felippe Schmoeller da Roza, Klaus Diepold

View PDF

Abstract:Reinforcement learning algorithms need exploration to learn. However, unsupervised exploration prevents the deployment of such algorithms on safety-critical tasks and limits real-world deployment. In this paper, we propose a new algorithm called Ensemble Model Predictive Safety Certification that combines model-based deep reinforcement learning with tube-based model predictive control to correct the actions taken by a learning agent, keeping safety constraint violations at a minimum through planning. Our approach aims to reduce the amount of prior knowledge about the actual system by requiring only offline data generated by a safe controller. Our results show that we can achieve significantly fewer constraint violations than comparable reinforcement learning methods.

Comments:	Published in: Proc. of the 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024)
Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2402.04182 [cs.LG]
	(or arXiv:2402.04182v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.04182

Submission history

From: Sven Gronauer [view email]
[v1] Tue, 6 Feb 2024 17:42:39 UTC (1,784 KB)

Computer Science > Machine Learning

Title:Reinforcement Learning with Ensemble Model Predictive Safety Certification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reinforcement Learning with Ensemble Model Predictive Safety Certification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators