Robust Imitation Learning from Noisy Demonstrations

Tangkaratt, Voot; Charoenphakdee, Nontawat; Sugiyama, Masashi

Statistics > Machine Learning

arXiv:2010.10181 (stat)

[Submitted on 20 Oct 2020 (v1), last revised 19 Feb 2021 (this version, v3)]

Title:Robust Imitation Learning from Noisy Demonstrations

Authors:Voot Tangkaratt, Nontawat Charoenphakdee, Masashi Sugiyama

View PDF

Abstract:Robust learning from noisy demonstrations is a practical but highly challenging problem in imitation learning. In this paper, we first theoretically show that robust imitation learning can be achieved by optimizing a classification risk with a symmetric loss. Based on this theoretical finding, we then propose a new imitation learning method that optimizes the classification risk by effectively combining pseudo-labeling with co-training. Unlike existing methods, our method does not require additional labels or strict assumptions about noise distributions. Experimental results on continuous-control benchmarks show that our method is more robust compared to state-of-the-art methods.

Comments:	16 pages, 9 figures. Accepted to AISTATS 2021
Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2010.10181 [stat.ML]
	(or arXiv:2010.10181v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2010.10181

Submission history

From: Voot Tangkaratt [view email]
[v1] Tue, 20 Oct 2020 10:41:37 UTC (6,125 KB)
[v2] Sat, 31 Oct 2020 05:34:29 UTC (6,125 KB)
[v3] Fri, 19 Feb 2021 13:38:24 UTC (12,089 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-10

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

export BibTeX citation

Statistics > Machine Learning

Title:Robust Imitation Learning from Noisy Demonstrations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Robust Imitation Learning from Noisy Demonstrations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators