Mobile Robot Navigation Using Hand-Drawn Maps: A Vision Language Model Approach

Tan, Aaron Hao; Fung, Angus; Wang, Haitong; Nejat, Goldie

Computer Science > Robotics

arXiv:2502.00114 (cs)

[Submitted on 31 Jan 2025]

Title:Mobile Robot Navigation Using Hand-Drawn Maps: A Vision Language Model Approach

Authors:Aaron Hao Tan, Angus Fung, Haitong Wang, Goldie Nejat

View PDF

Abstract:Hand-drawn maps can be used to convey navigation instructions between humans and robots in a natural and efficient manner. However, these maps can often contain inaccuracies such as scale distortions and missing landmarks which present challenges for mobile robot navigation. This paper introduces a novel Hand-drawn Map Navigation (HAM-Nav) architecture that leverages pre-trained vision language models (VLMs) for robot navigation across diverse environments, hand-drawing styles, and robot embodiments, even in the presence of map inaccuracies. HAM-Nav integrates a unique Selective Visual Association Prompting approach for topological map-based position estimation and navigation planning as well as a Predictive Navigation Plan Parser to infer missing landmarks. Extensive experiments were conducted in photorealistic simulated environments, using both wheeled and legged robots, demonstrating the effectiveness of HAM-Nav in terms of navigation success rates and Success weighted by Path Length. Furthermore, a user study in real-world environments highlighted the practical utility of hand-drawn maps for robot navigation as well as successful navigation outcomes.

Comments:	8 pages, 8 figures
Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2502.00114 [cs.RO]
	(or arXiv:2502.00114v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2502.00114

Submission history

From: Aaron (Hao) Tan [view email]
[v1] Fri, 31 Jan 2025 19:03:33 UTC (5,956 KB)

Computer Science > Robotics

Title:Mobile Robot Navigation Using Hand-Drawn Maps: A Vision Language Model Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Mobile Robot Navigation Using Hand-Drawn Maps: A Vision Language Model Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators