Robust Policy Search for Robot Navigation with Stochastic Meta-Policies

Garcia-Barcos, Javier; Martinez-Cantin, Ruben

Computer Science > Machine Learning

arXiv:2003.01000 (cs)

[Submitted on 2 Mar 2020]

Title:Robust Policy Search for Robot Navigation with Stochastic Meta-Policies

Authors:Javier Garcia-Barcos, Ruben Martinez-Cantin

View PDF

Abstract:Bayesian optimization is an efficient nonlinear optimization method where the queries are carefully selected to gather information about the optimum location. Thus, in the context of policy search, it has been called active policy search. The main ingredients of Bayesian optimization for sample efficiency are the probabilistic surrogate model and the optimal decision heuristics. In this work, we exploit those to provide robustness to different issues for policy search algorithms. We combine several methods and show how their interaction works better than the sum of the parts. First, to deal with input noise and provide a safe and repeatable policy we use an improved version of unscented Bayesian optimization. Then, to deal with mismodeling errors and improve exploration we use stochastic meta-policies for query selection and an adaptive kernel. We compare the proposed algorithm with previous results in several optimization benchmarks and robot tasks, such as pushing objects with a robot arm, or path finding with a rover.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2003.01000 [cs.LG]
	(or arXiv:2003.01000v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2003.01000

Submission history

From: Javier Garcia-Barcos [view email]
[v1] Mon, 2 Mar 2020 16:30:59 UTC (7,097 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat.ML

< prev | next >

new | recent | 2020-03

Change to browse by:

cs
cs.LG
stat

References & Citations

DBLP - CS Bibliography

listing | bibtex

Javier Garcia-Barcos
Ruben Martinez-Cantin

export BibTeX citation

Computer Science > Machine Learning

Title:Robust Policy Search for Robot Navigation with Stochastic Meta-Policies

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Robust Policy Search for Robot Navigation with Stochastic Meta-Policies

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators