A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic Search

Trabucco, Brandon; Sigurdsson, Gunnar; Piramuthu, Robinson; Sukhatme, Gaurav S.; Salakhutdinov, Ruslan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.13396 (cs)

[Submitted on 21 Jun 2022 (v1), last revised 9 Aug 2022 (this version, v2)]

Title:A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic Search

Authors:Brandon Trabucco, Gunnar Sigurdsson, Robinson Piramuthu, Gaurav S. Sukhatme, Ruslan Salakhutdinov

View PDF

Abstract:Physically rearranging objects is an important capability for embodied agents. Visual room rearrangement evaluates an agent's ability to rearrange objects in a room to a desired goal based solely on visual input. We propose a simple yet effective method for this problem: (1) search for and map which objects need to be rearranged, and (2) rearrange each object until the task is complete. Our approach consists of an off-the-shelf semantic segmentation model, voxel-based semantic map, and semantic search policy to efficiently find objects that need to be rearranged. On the AI2-THOR Rearrangement Challenge, our method improves on current state-of-the-art end-to-end reinforcement learning-based methods that learn visual rearrangement policies from 0.53% correct rearrangement to 16.56%, using only 2.7% as many samples from the environment.

Comments:	Winner of the Rearrangement Challenge at CVPR 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2206.13396 [cs.CV]
	(or arXiv:2206.13396v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.13396

Submission history

From: Brandon Trabucco [view email]
[v1] Tue, 21 Jun 2022 02:33:57 UTC (6,068 KB)
[v2] Tue, 9 Aug 2022 20:47:35 UTC (6,069 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic Search

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic Search

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators