K-VIL: Keypoints-based Visual Imitation Learning

Gao, Jianfeng; Tao, Zhi; Jaquier, Noémie; Asfour, Tamim

doi:10.1109/TRO.2023.3286074

Computer Science > Robotics

arXiv:2209.03277 (cs)

[Submitted on 7 Sep 2022 (v1), last revised 25 Jul 2023 (this version, v3)]

Title:K-VIL: Keypoints-based Visual Imitation Learning

Authors:Jianfeng Gao, Zhi Tao, Noémie Jaquier, Tamim Asfour

View PDF

Abstract:Visual imitation learning provides efficient and intuitive solutions for robotic systems to acquire novel manipulation skills. However, simultaneously learning geometric task constraints and control policies from visual inputs alone remains a challenging problem. In this paper, we propose an approach for keypoint-based visual imitation (K-VIL) that automatically extracts sparse, object-centric, and embodiment-independent task representations from a small number of human demonstration videos. The task representation is composed of keypoint-based geometric constraints on principal manifolds, their associated local frames, and the movement primitives that are then needed for the task execution. Our approach is capable of extracting such task representations from a single demonstration video, and of incrementally updating them when new demonstrations become available. To reproduce manipulation skills using the learned set of prioritized geometric constraints in novel scenes, we introduce a novel keypoint-based admittance controller. We evaluate our approach in several real-world applications, showcasing its ability to deal with cluttered scenes, viewpoint mismatch, new instances of categorical objects, and large object pose and shape variations, as well as its efficiency and robustness in both one-shot and few-shot imitation learning settings. Videos and source code are available at this https URL.

Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2209.03277 [cs.RO]
	(or arXiv:2209.03277v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2209.03277
Journal reference:	IEEE Transactions on Robotics, (2023) 1-21
Related DOI:	https://doi.org/10.1109/TRO.2023.3286074

Submission history

From: Jianfeng Gao [view email]
[v1] Wed, 7 Sep 2022 16:30:06 UTC (18,607 KB)
[v2] Mon, 20 Feb 2023 13:57:13 UTC (21,859 KB)
[v3] Tue, 25 Jul 2023 11:30:33 UTC (18,925 KB)

Computer Science > Robotics

Title:K-VIL: Keypoints-based Visual Imitation Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:K-VIL: Keypoints-based Visual Imitation Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators