Learning from Imperfect Demonstrations from Agents with Varying Dynamics

Cao, Zhangjie; Sadigh, Dorsa

Computer Science > Machine Learning

arXiv:2103.05910 (cs)

[Submitted on 10 Mar 2021]

Title:Learning from Imperfect Demonstrations from Agents with Varying Dynamics

Authors:Zhangjie Cao, Dorsa Sadigh

View PDF

Abstract:Imitation learning enables robots to learn from demonstrations. Previous imitation learning algorithms usually assume access to optimal expert demonstrations. However, in many real-world applications, this assumption is limiting. Most collected demonstrations are not optimal or are produced by an agent with slightly different dynamics. We therefore address the problem of imitation learning when the demonstrations can be sub-optimal or be drawn from agents with varying dynamics. We develop a metric composed of a feasibility score and an optimality score to measure how useful a demonstration is for imitation learning. The proposed score enables learning from more informative demonstrations, and disregarding the less relevant demonstrations. Our experiments on four environments in simulation and on a real robot show improved learned policies with higher expected return.

Comments:	Accpeted by ICRA 2021
Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2103.05910 [cs.LG]
	(or arXiv:2103.05910v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2103.05910

Submission history

From: Zhangjie Cao [view email]
[v1] Wed, 10 Mar 2021 07:39:38 UTC (8,110 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-03

Change to browse by:

cs
cs.RO

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhangjie Cao
Dorsa Sadigh

export BibTeX citation

Computer Science > Machine Learning

Title:Learning from Imperfect Demonstrations from Agents with Varying Dynamics

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning from Imperfect Demonstrations from Agents with Varying Dynamics

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators