Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving

Cao, Zhangjie; Bıyık, Erdem; Wang, Woodrow Z.; Raventos, Allan; Gaidon, Adrien; Rosman, Guy; Sadigh, Dorsa

Computer Science > Machine Learning

arXiv:2007.00178 (cs)

[Submitted on 1 Jul 2020]

Title:Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving

Authors:Zhangjie Cao, Erdem Bıyık, Woodrow Z. Wang, Allan Raventos, Adrien Gaidon, Guy Rosman, Dorsa Sadigh

View PDF

Abstract:Autonomous driving has achieved significant progress in recent years, but autonomous cars are still unable to tackle high-risk situations where a potential accident is likely. In such near-accident scenarios, even a minor change in the vehicle's actions may result in drastically different consequences. To avoid unsafe actions in near-accident scenarios, we need to fully explore the environment. However, reinforcement learning (RL) and imitation learning (IL), two widely-used policy learning methods, cannot model rapid phase transitions and are not scalable to fully cover all the states. To address driving in near-accident scenarios, we propose a hierarchical reinforcement and imitation learning (H-ReIL) approach that consists of low-level policies learned by IL for discrete driving modes, and a high-level policy learned by RL that switches between different driving modes. Our approach exploits the advantages of both IL and RL by integrating them into a unified learning framework. Experimental results and user studies suggest our approach can achieve higher efficiency and safety compared to other methods. Analyses of the policies demonstrate our high-level policy appropriately switches between different low-level policies in near-accident driving situations.

Comments:	10 pages, 7 figures. Published at Robotics: Science and Systems (RSS) 2020
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO); Systems and Control (eess.SY); Machine Learning (stat.ML)
Cite as:	arXiv:2007.00178 [cs.LG]
	(or arXiv:2007.00178v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2007.00178

Submission history

From: Erdem Bıyık [view email]
[v1] Wed, 1 Jul 2020 01:41:45 UTC (7,757 KB)

Computer Science > Machine Learning

Title:Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators