Inferring Smooth Control: Monte Carlo Posterior Policy Iteration with Gaussian Processes

Watson, Joe; Peters, Jan

Computer Science > Machine Learning

arXiv:2210.03512 (cs)

[Submitted on 7 Oct 2022]

Title:Inferring Smooth Control: Monte Carlo Posterior Policy Iteration with Gaussian Processes

Authors:Joe Watson, Jan Peters

View PDF

Abstract:Monte Carlo methods have become increasingly relevant for control of non-differentiable systems, approximate dynamics models and learning from data. These methods scale to high-dimensional spaces and are effective at the non-convex optimizations often seen in robot learning. We look at sample-based methods from the perspective of inference-based control, specifically posterior policy iteration. From this perspective, we highlight how Gaussian noise priors produce rough control actions that are unsuitable for physical robot deployment. Considering smoother Gaussian process priors, as used in episodic reinforcement learning and motion planning, we demonstrate how smoother model predictive control can be achieved using online sequential inference. This inference is realized through an efficient factorization of the action distribution and a novel means of optimizing the likelihood temperature to improve importance sampling accuracy. We evaluate this approach on several high-dimensional robot control tasks, matching the sample efficiency of prior heuristic methods while also ensuring smoothness. Simulation results can be seen at this https URL.

Comments:	43 pages, 37 figures. Conference on Robot Learning 2022
Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2210.03512 [cs.LG]
	(or arXiv:2210.03512v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.03512

Submission history

From: Joe Watson [view email]
[v1] Fri, 7 Oct 2022 12:56:31 UTC (4,180 KB)

Computer Science > Machine Learning

Title:Inferring Smooth Control: Monte Carlo Posterior Policy Iteration with Gaussian Processes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Inferring Smooth Control: Monte Carlo Posterior Policy Iteration with Gaussian Processes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators