Sample-Efficient Imitation Learning via Generative Adversarial Nets

Blondé, Lionel; Kalousis, Alexandros

Computer Science > Machine Learning

arXiv:1809.02064 (cs)

[Submitted on 6 Sep 2018 (v1), last revised 8 Mar 2019 (this version, v3)]

Title:Sample-Efficient Imitation Learning via Generative Adversarial Nets

Authors:Lionel Blondé, Alexandros Kalousis

View PDF

Abstract:GAIL is a recent successful imitation learning architecture that exploits the adversarial training procedure introduced in GANs. Albeit successful at generating behaviours similar to those demonstrated to the agent, GAIL suffers from a high sample complexity in the number of interactions it has to carry out in the environment in order to achieve satisfactory performance. We dramatically shrink the amount of interactions with the environment necessary to learn well-behaved imitation policies, by up to several orders of magnitude. Our framework, operating in the model-free regime, exhibits a significant increase in sample-efficiency over previous methods by simultaneously a) learning a self-tuned adversarially-trained surrogate reward and b) leveraging an off-policy actor-critic architecture. We show that our approach is simple to implement and that the learned agents remain remarkably stable, as shown in our experiments that span a variety of continuous control tasks. Video visualisations available at: \url{this https URL}.

Comments:	Published as a conference paper for AISTATS 2019
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1809.02064 [cs.LG]
	(or arXiv:1809.02064v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1809.02064

Submission history

From: Lionel Blondé [view email]
[v1] Thu, 6 Sep 2018 15:55:16 UTC (1,211 KB)
[v2] Tue, 23 Oct 2018 15:31:32 UTC (2,473 KB)
[v3] Fri, 8 Mar 2019 12:00:07 UTC (3,596 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-09

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Lionel Blondé
Alexandros Kalousis

export BibTeX citation

Computer Science > Machine Learning

Title:Sample-Efficient Imitation Learning via Generative Adversarial Nets

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sample-Efficient Imitation Learning via Generative Adversarial Nets

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators