Computer Science > Robotics
[Submitted on 31 Jan 2018 (v1), revised 16 Mar 2018 (this version, v2), latest version 16 Oct 2018 (v3)]
Title:Model-Free Error Detection and Recovery for Robot Learning from Demonstration
View PDFAbstract:Learning from human demonstrations can facilitate automation but is risky because the execution of the learned policy might lead to collisions and other failures. Adding explicit constraints to avoid unsafe states is generally not possible when the state representations are complex. Furthermore, enforcing these constraints during execution of the learned policy can be challenging in environments where dynamics are difficult to model such as push mechanics in grasping. In this paper, we propose a two-phase method for generating robust policies from demonstrations in robotic manipulation tasks. In the first phase, we use support estimation of supervisor demonstrations and treat the support as implicit constraints on states in addition to learning a policy directly from the observed controls. We also propose a time-variant modification to the support estimation problem allowing for accurate estimation on sequential tasks. In the second phase, we use a switching policy to steer the robot from leaving safe regions of the state space during run time using the decision function of the estimated support. The policy switches between the robot's learned policy and a novel recovery policy depending on the distance to the boundary of the support. We present additional conditions, which linearly bound the difference in state at each time step by the magnitude of control, allowing us to prove that the robot will not violate the constraints using the recovery policy. A simulated pushing task suggests that support estimation and recovery control can reduce collisions by 83%. On a physical line tracking task using a da Vinci Surgical Robot, recovery control reduced collisions by 84%.
Submission history
From: Jonathan Lee [view email][v1] Wed, 31 Jan 2018 06:57:58 UTC (6,309 KB)
[v2] Fri, 16 Mar 2018 22:48:47 UTC (2,395 KB)
[v3] Tue, 16 Oct 2018 09:42:57 UTC (5,058 KB)
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.