Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network

Moon, Hee-Seung; Seo, Jiwon

doi:10.1109/ACCESS.2022.3210932

Computer Science > Robotics

arXiv:2008.05054 (cs)

[Submitted on 12 Aug 2020 (v1), last revised 28 Sep 2022 (this version, v2)]

Title:Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network

Authors:Hee-Seung Moon, Jiwon Seo

View PDF

Abstract:Training a robot that engages with people is challenging; it is expensive to directly involve people in the training process, which requires numerous data samples. This paper presents an alternative approach for resolving this problem. We propose a human path prediction network (HPPN) that generates a user's future trajectory based on sequential robot actions and human responses using a recurrent-neural-network structure. Subsequently, an evolution-strategy-based robot training method using only the virtual human movements generated using the HPPN is presented. It is demonstrated that our proposed method permits sample-efficient training of a robotic guide for visually impaired people. By collecting only 1.5 K episodes from real users, we were able to train the HPPN and generate more than 100 K virtual episodes required for training the robot. The trained robot precisely guided blindfolded participants along a target path. Furthermore, using virtual episodes, we investigated a new reward design that prioritizes human comfort during the robot's guidance without incurring additional costs. This sample-efficient training method is expected to be widely applicable to future robots that interact physically with humans.

Comments:	To be published in IEEE Access
Subjects:	Robotics (cs.RO); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2008.05054 [cs.RO]
	(or arXiv:2008.05054v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2008.05054
Related DOI:	https://doi.org/10.1109/ACCESS.2022.3210932

Submission history

From: Hee-Seung Moon [view email]
[v1] Wed, 12 Aug 2020 01:15:38 UTC (17,111 KB)
[v2] Wed, 28 Sep 2022 09:06:21 UTC (12,862 KB)

Computer Science > Robotics

Title:Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators