PointPatchRL -- Masked Reconstruction Improves Reinforcement Learning on Point Clouds

Gyenes, Balázs; Franke, Nikolai; Becker, Philipp; Neumann, Gerhard

Computer Science > Machine Learning

arXiv:2410.18800 (cs)

[Submitted on 24 Oct 2024]

Title:PointPatchRL -- Masked Reconstruction Improves Reinforcement Learning on Point Clouds

Authors:Balázs Gyenes, Nikolai Franke, Philipp Becker, Gerhard Neumann

View PDF HTML (experimental)

Abstract:Perceiving the environment via cameras is crucial for Reinforcement Learning (RL) in robotics. While images are a convenient form of representation, they often complicate extracting important geometric details, especially with varying geometries or deformable objects. In contrast, point clouds naturally represent this geometry and easily integrate color and positional data from multiple camera views. However, while deep learning on point clouds has seen many recent successes, RL on point clouds is under-researched, with only the simplest encoder architecture considered in the literature. We introduce PointPatchRL (PPRL), a method for RL on point clouds that builds on the common paradigm of dividing point clouds into overlapping patches, tokenizing them, and processing the tokens with transformers. PPRL provides significant improvements compared with other point-cloud processing architectures previously used for RL. We then complement PPRL with masked reconstruction for representation learning and show that our method outperforms strong model-free and model-based baselines on image observations in complex manipulation tasks containing deformable objects and variations in target object geometry. Videos and code are available at this https URL

Comments:	18 pages, 15 figures, accepted for publication at the 8th Conference on Robot Learning (CoRL 2024)
Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2410.18800 [cs.LG]
	(or arXiv:2410.18800v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.18800

Submission history

From: Balázs Gyenes [view email]
[v1] Thu, 24 Oct 2024 14:51:09 UTC (3,072 KB)

Computer Science > Machine Learning

Title:PointPatchRL -- Masked Reconstruction Improves Reinforcement Learning on Point Clouds

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:PointPatchRL -- Masked Reconstruction Improves Reinforcement Learning on Point Clouds

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators