Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization

Wen, Lu; Duan, Jingliang; Li, Shengbo Eben; Xu, Shaobing; Peng, Huei

Computer Science > Machine Learning

arXiv:2003.01303 (cs)

[Submitted on 3 Mar 2020]

Title:Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization

Authors:Lu Wen, Jingliang Duan, Shengbo Eben Li, Shaobing Xu, Huei Peng

View PDF

Abstract:Reinforcement learning (RL) is attracting increasing interests in autonomous driving due to its potential to solve complex classification and control problems. However, existing RL algorithms are rarely applied to real vehicles for two predominant problems: behaviours are unexplainable, and they cannot guarantee safety under new scenarios. This paper presents a safe RL algorithm, called Parallel Constrained Policy Optimization (PCPO), for two autonomous driving tasks. PCPO extends today's common actor-critic architecture to a three-component learning framework, in which three neural networks are used to approximate the policy function, value function and a newly added risk function, respectively. Meanwhile, a trust region constraint is added to allow large update steps without breaking the monotonic improvement condition. To ensure the feasibility of safety constrained problems, synchronized parallel learners are employed to explore different state spaces, which accelerates learning and policy-update. The simulations of two scenarios for autonomous vehicles confirm we can ensure safety while achieving fast learning.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2003.01303 [cs.LG]
	(or arXiv:2003.01303v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2003.01303

Submission history

From: Lu Wen [view email]
[v1] Tue, 3 Mar 2020 02:53:30 UTC (2,117 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-03

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shengbo Eben Li
Huei Peng

export BibTeX citation

Computer Science > Machine Learning

Title:Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators