Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

Sharma, Arjun; Sharma, Mohit; Rhinehart, Nicholas; Kitani, Kris M.

Computer Science > Machine Learning

arXiv:1810.01266 (cs)

[Submitted on 29 Sep 2018 (v1), last revised 12 Mar 2019 (this version, v2)]

Title:Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

Authors:Arjun Sharma, Mohit Sharma, Nicholas Rhinehart, Kris M. Kitani

View PDF

Abstract:The use of imitation learning to learn a single policy for a complex task that has multiple modes or hierarchical structure can be challenging. In fact, previous work has shown that when the modes are known, learning separate policies for each mode or sub-task can greatly improve the performance of imitation learning. In this work, we discover the interaction between sub-tasks from their resulting state-action trajectory sequences using a directed graphical model. We propose a new algorithm based on the generative adversarial imitation learning framework which automatically learns sub-task policies from unsegmented demonstrations. Our approach maximizes the directed information flow in the graphical model between sub-task latent variables and their generated trajectories. We also show how our approach connects with the existing Options framework, which is commonly used to learn hierarchical policies.

Comments:	Accepted as conference paper at ICLR'19
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1810.01266 [cs.LG]
	(or arXiv:1810.01266v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.01266

Submission history

From: Mohit Sharma [view email]
[v1] Sat, 29 Sep 2018 18:40:13 UTC (1,995 KB)
[v2] Tue, 12 Mar 2019 02:06:19 UTC (2,212 KB)

Computer Science > Machine Learning

Title:Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators