See What the Robot Can't See: Learning Cooperative Perception for Visual Navigation

Blumenkamp, Jan; Li, Qingbiao; Wang, Binyu; Liu, Zhe; Prorok, Amanda

Computer Science > Robotics

arXiv:2208.00759 (cs)

[Submitted on 1 Aug 2022 (v1), last revised 31 Jul 2023 (this version, v5)]

Title:See What the Robot Can't See: Learning Cooperative Perception for Visual Navigation

Authors:Jan Blumenkamp, Qingbiao Li, Binyu Wang, Zhe Liu, Amanda Prorok

View PDF

Abstract:We consider the problem of navigating a mobile robot towards a target in an unknown environment that is endowed with visual sensors, where neither the robot nor the sensors have access to global positioning information and only use first-person-view images. In order to overcome the need for positioning, we train the sensors to encode and communicate relevant viewpoint information to the mobile robot, whose objective it is to use this information to navigate to the target along the shortest path. We overcome the challenge of enabling all the sensors (even those that cannot directly see the target) to predict the direction along the shortest path to the target by implementing a neighborhood-based feature aggregation module using a Graph Neural Network (GNN) architecture. In our experiments, we first demonstrate generalizability to previously unseen environments with various sensor layouts. Our results show that by using communication between the sensors and the robot, we achieve up to 2.0x improvement in SPL (Success weighted by Path Length) when compared to a communication-free baseline. This is done without requiring a global map, positioning data, nor pre-calibration of the sensor network. Second, we perform a zero-shot transfer of our model from simulation to the real world. Laboratory experiments demonstrate the feasibility of our approach in various cluttered environments. Finally, we showcase examples of successful navigation to the target while both the sensor network layout as well as obstacles are dynamically reconfigured as the robot navigates. We provide a video demo, the dataset, trained models, and source code.
this https URL this https URL

Comments:	Accepted to be presented at IROS 2023
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Systems and Control (eess.SY)
Cite as:	arXiv:2208.00759 [cs.RO]
	(or arXiv:2208.00759v5 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2208.00759

Submission history

From: Jan Blumenkamp [view email]
[v1] Mon, 1 Aug 2022 11:37:01 UTC (25,143 KB)
[v2] Fri, 5 Aug 2022 09:50:00 UTC (12,568 KB)
[v3] Tue, 30 Aug 2022 15:10:58 UTC (20,710 KB)
[v4] Tue, 18 Apr 2023 16:28:41 UTC (12,058 KB)
[v5] Mon, 31 Jul 2023 16:40:56 UTC (5,944 KB)

Computer Science > Robotics

Title:See What the Robot Can't See: Learning Cooperative Perception for Visual Navigation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:See What the Robot Can't See: Learning Cooperative Perception for Visual Navigation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators