Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning

James, Stephen; Abbeel, Pieter

Computer Science > Robotics

arXiv:2202.03957 (cs)

[Submitted on 8 Feb 2022]

Title:Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning

Authors:Stephen James, Pieter Abbeel

View PDF

Abstract:We propose a new policy parameterization for representing 3D rotations during reinforcement learning. Today in the continuous control reinforcement learning literature, many stochastic policy parameterizations are Gaussian. We argue that universally applying a Gaussian policy parameterization is not always desirable for all environments. One such case in particular where this is true are tasks that involve predicting a 3D rotation output, either in isolation, or coupled with translation as part of a full 6D pose output. Our proposed Bingham Policy Parameterization (BPP) models the Bingham distribution and allows for better rotation (quaternion) prediction over a Gaussian policy parameterization in a range of reinforcement learning tasks. We evaluate BPP on the rotation Wahba problem task, as well as a set of vision-based next-best pose robot manipulation tasks from RLBench. We hope that this paper encourages more research into developing other policy parameterization that are more suited for particular environments, rather than always assuming Gaussian.

Comments:	Project page and code: this https URL
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2202.03957 [cs.RO]
	(or arXiv:2202.03957v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2202.03957

Submission history

From: Stephen James [view email]
[v1] Tue, 8 Feb 2022 16:09:02 UTC (7,018 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2022-02

Change to browse by:

cs
cs.AI
cs.CV
cs.RO

References & Citations

DBLP - CS Bibliography

listing | bibtex

Stephen James
Pieter Abbeel

export BibTeX citation

Computer Science > Robotics

Title:Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators