Transfering Hierarchical Structure with Dual Meta Imitation Learning

Gao, Chongkai; Jiang, Yizhou; Chen, Feng

Computer Science > Robotics

arXiv:2201.11981 (cs)

[Submitted on 28 Jan 2022 (v1), last revised 19 Feb 2022 (this version, v2)]

Title:Transfering Hierarchical Structure with Dual Meta Imitation Learning

Authors:Chongkai Gao, Yizhou Jiang, Feng Chen

View PDF

Abstract:Hierarchical Imitation Learning (HIL) is an effective way for robots to learn sub-skills from long-horizon unsegmented demonstrations. However, the learned hierarchical structure lacks the mechanism to transfer across multi-tasks or to new tasks, which makes them have to learn from scratch when facing a new situation. Transferring and reorganizing modular sub-skills require fast adaptation ability of the whole hierarchical structure. In this work, we propose Dual Meta Imitation Learning (DMIL), a hierarchical meta imitation learning method where the high-level network and sub-skills are iteratively meta-learned with model-agnostic meta-learning. DMIL uses the likelihood of state-action pairs from each sub-skill as the supervision for the high-level network adaptation, and use the adapted high-level network to determine different data set for each sub-skill adaptation. We theoretically prove the convergence of the iterative training process of DMIL and establish the connection between DMIL and Expectation-Maximization algorithm. Empirically, we achieve state-of-the-art few-shot imitation learning performance on the Meta-world \cite{metaworld} benchmark and competitive results on long-horizon tasks of Kitchen environments.

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2201.11981 [cs.RO]
	(or arXiv:2201.11981v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2201.11981

Submission history

From: Chongkai Gao [view email]
[v1] Fri, 28 Jan 2022 08:22:38 UTC (8,301 KB)
[v2] Sat, 19 Feb 2022 00:01:31 UTC (8,288 KB)

Computer Science > Robotics

Title:Transfering Hierarchical Structure with Dual Meta Imitation Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Transfering Hierarchical Structure with Dual Meta Imitation Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators