On the Robustness of Safe Reinforcement Learning under Observational Perturbations

Liu, Zuxin; Guo, Zijian; Cen, Zhepeng; Zhang, Huan; Tan, Jie; Li, Bo; Zhao, Ding

Computer Science > Machine Learning

arXiv:2205.14691 (cs)

[Submitted on 29 May 2022 (v1), last revised 2 Mar 2023 (this version, v3)]

Title:On the Robustness of Safe Reinforcement Learning under Observational Perturbations

Authors:Zuxin Liu, Zijian Guo, Zhepeng Cen, Huan Zhang, Jie Tan, Bo Li, Ding Zhao

View PDF

Abstract:Safe reinforcement learning (RL) trains a policy to maximize the task reward while satisfying safety constraints. While prior works focus on the performance optimality, we find that the optimal solutions of many safe RL problems are not robust and safe against carefully designed observational perturbations. We formally analyze the unique properties of designing effective observational adversarial attackers in the safe RL setting. We show that baseline adversarial attack techniques for standard RL tasks are not always effective for safe RL and propose two new approaches - one maximizes the cost and the other maximizes the reward. One interesting and counter-intuitive finding is that the maximum reward attack is strong, as it can both induce unsafe behaviors and make the attack stealthy by maintaining the reward. We further propose a robust training framework for safe RL and evaluate it via comprehensive experiments. This paper provides a pioneer work to investigate the safety and robustness of RL under observational attacks for future safe RL studies. Code is available at: \url{this https URL}

Comments:	Published at the 11th International Conference on Learning Representations (ICLR 2023). 30 pages, 5 figures, 8 tables
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2205.14691 [cs.LG]
	(or arXiv:2205.14691v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.14691

Submission history

From: Zuxin Liu [view email]
[v1] Sun, 29 May 2022 15:25:03 UTC (1,577 KB)
[v2] Mon, 3 Oct 2022 05:08:11 UTC (5,165 KB)
[v3] Thu, 2 Mar 2023 02:56:47 UTC (3,904 KB)

Computer Science > Machine Learning

Title:On the Robustness of Safe Reinforcement Learning under Observational Perturbations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the Robustness of Safe Reinforcement Learning under Observational Perturbations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators