Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels

Deshpande, Abhay; Ke, Liyiming; Pfeifer, Quinn; Gupta, Abhishek; Srinivasa, Siddhartha S.

Computer Science > Robotics

arXiv:2405.19307 (cs)

[Submitted on 29 May 2024 (v1), last revised 21 Oct 2024 (this version, v3)]

Title:Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels

Authors:Abhay Deshpande, Liyiming Ke, Quinn Pfeifer, Abhishek Gupta, Siddhartha S. Srinivasa

View PDF HTML (experimental)

Abstract:We consider imitation learning with access only to expert demonstrations, whose real-world application is often limited by covariate shift due to compounding errors during execution. We investigate the effectiveness of the Continuity-based Corrective Labels for Imitation Learning (CCIL) framework in mitigating this issue for real-world fine manipulation tasks. CCIL generates corrective labels by learning a locally continuous dynamics model from demonstrations to guide the agent back toward expert states. Through extensive experiments on peg insertion and fine grasping, we provide the first empirical validation that CCIL can significantly improve imitation learning performance despite discontinuities present in contact-rich manipulation. We find that: (1) real-world manipulation exhibits sufficient local smoothness to apply CCIL, (2) generated corrective labels are most beneficial in low-data regimes, and (3) label filtering based on estimated dynamics model error enables performance gains. To effectively apply CCIL to robotic domains, we offer a practical instantiation of the framework and insights into design choices and hyperparameter selection. Our work demonstrates CCIL's practicality for alleviating compounding errors in imitation learning on physical robots.

Comments:	Presented at IROS 2024
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2405.19307 [cs.RO]
	(or arXiv:2405.19307v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2405.19307

Submission history

From: Abhay Deshpande [view email]
[v1] Wed, 29 May 2024 17:31:25 UTC (28,112 KB)
[v2] Mon, 3 Jun 2024 20:42:00 UTC (28,118 KB)
[v3] Mon, 21 Oct 2024 16:44:07 UTC (28,119 KB)

Computer Science > Robotics

Title:Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators