Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network

Moon, Hee-Seung; Seo, Jiwon

Computer Science > Robotics

arXiv:2008.05054v1 (cs)

[Submitted on 12 Aug 2020 (this version), latest version 28 Sep 2022 (v2)]

Title:Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network

Authors:Hee-Seung Moon, Jiwon Seo

View PDF

Abstract:Training a robot that engages with people is challenging, because it is expensive to involve people in a robot training process requiring numerous data samples. This paper proposes a human path prediction network (HPPN) and an evolution strategy-based robot training method using virtual human movements generated by the HPPN, which compensates for this sample inefficiency problem. We applied the proposed method to the training of a robotic guide for visually impaired people, which was designed to collect multimodal human response data and reflect such data when selecting the robot's actions. We collected 1,507 real-world episodes for training the HPPN and then generated over 100,000 virtual episodes for training the robot policy. User test results indicate that our trained robot accurately guides blindfolded participants along a goal path. In addition, by the designed reward to pursue both guidance accuracy and human comfort during the robot policy training process, our robot leads to improved smoothness in human motion while maintaining the accuracy of the guidance. This sample-efficient training method is expected to be widely applicable to all robots and computing machinery that physically interact with humans.

Subjects:	Robotics (cs.RO); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2008.05054 [cs.RO]
	(or arXiv:2008.05054v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2008.05054

Submission history

From: Hee-Seung Moon [view email]
[v1] Wed, 12 Aug 2020 01:15:38 UTC (17,111 KB)
[v2] Wed, 28 Sep 2022 09:06:21 UTC (12,862 KB)

Computer Science > Robotics

Title:Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators