Sim-to-Real Transfer for Vision-and-Language Navigation

Anderson, Peter; Shrivastava, Ayush; Truong, Joanne; Majumdar, Arjun; Parikh, Devi; Batra, Dhruv; Lee, Stefan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2011.03807 (cs)

[Submitted on 7 Nov 2020]

Title:Sim-to-Real Transfer for Vision-and-Language Navigation

Authors:Peter Anderson, Ayush Shrivastava, Joanne Truong, Arjun Majumdar, Devi Parikh, Dhruv Batra, Stefan Lee

View PDF

Abstract:We study the challenging problem of releasing a robot in a previously unseen environment, and having it follow unconstrained natural language navigation instructions. Recent work on the task of Vision-and-Language Navigation (VLN) has achieved significant progress in simulation. To assess the implications of this work for robotics, we transfer a VLN agent trained in simulation to a physical robot. To bridge the gap between the high-level discrete action space learned by the VLN agent, and the robot's low-level continuous action space, we propose a subgoal model to identify nearby waypoints, and use domain randomization to mitigate visual domain differences. For accurate sim and real comparisons in parallel environments, we annotate a 325m2 office space with 1.3km of navigation instructions, and create a digitized replica in simulation. We find that sim-to-real transfer to an environment not seen in training is successful if an occupancy map and navigation graph can be collected and annotated in advance (success rate of 46.8% vs. 55.9% in sim), but much more challenging in the hardest setting with no prior mapping at all (success rate of 22.5%).

Comments:	CoRL 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Robotics (cs.RO)
Cite as:	arXiv:2011.03807 [cs.CV]
	(or arXiv:2011.03807v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2011.03807

Submission history

From: Peter Anderson [view email]
[v1] Sat, 7 Nov 2020 16:49:04 UTC (18,232 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Sim-to-Real Transfer for Vision-and-Language Navigation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Sim-to-Real Transfer for Vision-and-Language Navigation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators