Inverse Reinforcement Learning via Deep Gaussian Process

Jin, Ming; Spanos, Costas

Computer Science > Machine Learning

arXiv:1512.08065v1 (cs)

[Submitted on 26 Dec 2015 (this version), latest version 4 May 2017 (v4)]

Title:Inverse Reinforcement Learning via Deep Gaussian Process

Authors:Ming Jin, Costas Spanos

View PDF

Abstract:The report proposes a new approach for inverse reinforcement learning based on deep Gaussian process (GP), which is capable of learning complicated reward structures with few demonstrations. The model stacks multiple latent GP layers to learn abstract representations of the state feature space, which is linked to the demonstrations through the Maximum Entropy learning framework. As analytic derivation of the model evidence is prohibitive due to the nonlinearity of latent variables, variational inference is employed for approximate inference, based on a special choice of variational distributions. This guards the model from over training, achieving the Automatic Occam's Razor. Experiments on the benchmark test, i.e., object world, as well as a new setup, i.e., binary world, are performed, where the proposed method outperforms state-of-the-art approaches.

Subjects:	Machine Learning (cs.LG); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as:	arXiv:1512.08065 [cs.LG]
	(or arXiv:1512.08065v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1512.08065

Submission history

From: Ming Jin [view email]
[v1] Sat, 26 Dec 2015 01:40:37 UTC (431 KB)
[v2] Thu, 30 Mar 2017 03:36:37 UTC (1,329 KB)
[v3] Tue, 2 May 2017 03:11:45 UTC (1,329 KB)
[v4] Thu, 4 May 2017 23:20:24 UTC (1,329 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2015-12

Change to browse by:

cs
cs.RO
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ming Jin
Costas J. Spanos

export BibTeX citation

Computer Science > Machine Learning

Title:Inverse Reinforcement Learning via Deep Gaussian Process

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Inverse Reinforcement Learning via Deep Gaussian Process

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators