Generating Contextually-Relevant Navigation Instructions for Blind and Low Vision People

Merchant, Zain; Anwar, Abrar; Wang, Emily; Chattopadhyay, Souti; Thomason, Jesse

Computer Science > Computation and Language

arXiv:2407.08219 (cs)

[Submitted on 11 Jul 2024]

Title:Generating Contextually-Relevant Navigation Instructions for Blind and Low Vision People

Authors:Zain Merchant, Abrar Anwar, Emily Wang, Souti Chattopadhyay, Jesse Thomason

View PDF HTML (experimental)

Abstract:Navigating unfamiliar environments presents significant challenges for blind and low-vision (BLV) individuals. In this work, we construct a dataset of images and goals across different scenarios such as searching through kitchens or navigating outdoors. We then investigate how grounded instruction generation methods can provide contextually-relevant navigational guidance to users in these instances. Through a sighted user study, we demonstrate that large pretrained language models can produce correct and useful instructions perceived as beneficial for BLV users. We also conduct a survey and interview with 4 BLV users and observe useful insights on preferences for different instructions based on the scenario.

Comments:	Accepted as RO-MAN 2024 Late Breaking Report
Subjects:	Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2407.08219 [cs.CL]
	(or arXiv:2407.08219v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.08219

Submission history

From: Abrar Anwar [view email]
[v1] Thu, 11 Jul 2024 06:40:36 UTC (16,162 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2024-07

Change to browse by:

cs
cs.HC

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Generating Contextually-Relevant Navigation Instructions for Blind and Low Vision People

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Generating Contextually-Relevant Navigation Instructions for Blind and Low Vision People

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators