Inverse Reinforcement Learning via Deep Gaussian Process

Jin, Ming; Damianou, Andreas; Abbeel, Pieter; Spanos, Costas

Computer Science > Machine Learning

arXiv:1512.08065v3 (cs)

[Submitted on 26 Dec 2015 (v1), revised 2 May 2017 (this version, v3), latest version 4 May 2017 (v4)]

Title:Inverse Reinforcement Learning via Deep Gaussian Process

Authors:Ming Jin, Andreas Damianou, Pieter Abbeel, Costas Spanos

View PDF

Abstract:We propose a new approach to inverse reinforcement learning (IRL) based on the deep Gaussian process (deep GP) model, which is capable of learning complicated reward structures with few demonstrations. Our model stacks multiple latent GP layers to learn abstract representations of the state feature space, which is linked to the demonstrations through the Maximum Entropy learning framework. Incorporating the IRL engine into the nonlinear latent structure renders existing deep GP inference approaches intractable. To tackle this, we develop a non-standard variational approximation framework which extends previous inference schemes. This allows for approximate Bayesian treatment of the feature space and guards against overfitting. Carrying out representation and inverse reinforcement learning simultaneously within our model outperforms state-of-the-art approaches, as we demonstrate with experiments on standard benchmarks ("object world","highway driving") and a new benchmark ("binary world").

Subjects:	Machine Learning (cs.LG); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as:	arXiv:1512.08065 [cs.LG]
	(or arXiv:1512.08065v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1512.08065

Submission history

From: Ming Jin [view email]
[v1] Sat, 26 Dec 2015 01:40:37 UTC (431 KB)
[v2] Thu, 30 Mar 2017 03:36:37 UTC (1,329 KB)
[v3] Tue, 2 May 2017 03:11:45 UTC (1,329 KB)
[v4] Thu, 4 May 2017 23:20:24 UTC (1,329 KB)

Computer Science > Machine Learning

Title:Inverse Reinforcement Learning via Deep Gaussian Process

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Inverse Reinforcement Learning via Deep Gaussian Process

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators