Adversarial Imitation Learning from Video using a State Observer

Karnan, Haresh; Warnell, Garrett; Torabi, Faraz; Stone, Peter

Computer Science > Robotics

arXiv:2202.00243v1 (cs)

[Submitted on 1 Feb 2022 (this version), latest version 27 Jul 2022 (v2)]

Title:Adversarial Imitation Learning from Video using a State Observer

Authors:Haresh Karnan, Garrett Warnell, Faraz Torabi, Peter Stone

View PDF

Abstract:The imitation learning research community has recently made significant progress towards the goal of enabling artificial agents to imitate behaviors from video demonstrations alone. However, current state-of-the-art approaches developed for this problem exhibit high sample complexity due, in part, to the high-dimensional nature of video observations. Towards addressing this issue, we introduce here a new algorithm called Visual Generative Adversarial Imitation from Observation using a State Observer VGAIfO-SO. At its core, VGAIfO-SO seeks to address sample inefficiency using a novel, self-supervised state observer, which provides estimates of lower-dimensional proprioceptive state representations from high-dimensional images. We show experimentally in several continuous control environments that VGAIfO-SO is more sample efficient than other IfO algorithms at learning from video-only demonstrations and can sometimes even achieve performance close to the Generative Adversarial Imitation from Observation (GAIfO) algorithm that has privileged access to the demonstrator's proprioceptive state information.

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2202.00243 [cs.RO]
	(or arXiv:2202.00243v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2202.00243
Journal reference:	International Conference on Robotics and Automation (ICRA) 2022

Submission history

From: Haresh Karnan [view email]
[v1] Tue, 1 Feb 2022 06:46:48 UTC (1,734 KB)
[v2] Wed, 27 Jul 2022 00:35:11 UTC (1,734 KB)

Computer Science > Robotics

Title:Adversarial Imitation Learning from Video using a State Observer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Adversarial Imitation Learning from Video using a State Observer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators