Simulation-Aided Policy Tuning for Black-Box Robot Learning

He, Shiming; von Rohr, Alexander; Baumann, Dominik; Xiang, Ji; Trimpe, Sebastian

Computer Science > Robotics

arXiv:2411.14246 (cs)

[Submitted on 21 Nov 2024]

Title:Simulation-Aided Policy Tuning for Black-Box Robot Learning

Authors:Shiming He, Alexander von Rohr, Dominik Baumann, Ji Xiang, Sebastian Trimpe

View PDF HTML (experimental)

Abstract:How can robots learn and adapt to new tasks and situations with little data? Systematic exploration and simulation are crucial tools for efficient robot learning. We present a novel black-box policy search algorithm focused on data-efficient policy improvements. The algorithm learns directly on the robot and treats simulation as an additional information source to speed up the learning process. At the core of the algorithm, a probabilistic model learns the dependence of the policy parameters and the robot learning objective not only by performing experiments on the robot, but also by leveraging data from a simulator. This substantially reduces interaction time with the robot. Using this model, we can guarantee improvements with high probability for each policy update, thereby facilitating fast, goal-oriented learning. We evaluate our algorithm on simulated fine-tuning tasks and demonstrate the data-efficiency of the proposed dual-information source optimization algorithm. In a real robot learning experiment, we show fast and successful task learning on a robot manipulator with the aid of an imperfect simulator.

Subjects:	Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2411.14246 [cs.RO]
	(or arXiv:2411.14246v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2411.14246

Submission history

From: Alexander von Rohr [view email]
[v1] Thu, 21 Nov 2024 15:52:23 UTC (3,819 KB)

Computer Science > Robotics

Title:Simulation-Aided Policy Tuning for Black-Box Robot Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Simulation-Aided Policy Tuning for Black-Box Robot Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators