Risk-Averse Biased Human Policies in Assistive Multi-Armed Bandit Settings

Koller, Michael; Patten, Timothy; Vincze, Markus

Computer Science > Robotics

arXiv:2104.05334 (cs)

[Submitted on 12 Apr 2021]

Title:Risk-Averse Biased Human Policies in Assistive Multi-Armed Bandit Settings

Authors:Michael Koller, Timothy Patten, Markus Vincze

View PDF

Abstract:Assistive multi-armed bandit problems can be used to model team situations between a human and an autonomous system like a domestic service robot. To account for human biases such as the risk-aversion described in the Cumulative Prospect Theory, the setting is expanded to using observable rewards. When robots leverage knowledge about the risk-averse human model they eliminate the bias and make more rational choices. We present an algorithm that increases the utility value of such human-robot teams. A brief evaluation indicates that arbitrary reward functions can be handled.

Comments:	in TRAITS Workshop Proceedings (arXiv:2103.12679) held in conjunction with Companion of the 2021 ACM/IEEE International Conference on Human-Robot Interaction, March 2021, Pages 709-711
Subjects:	Robotics (cs.RO)
Report number:	TRAITS/2021/10
Cite as:	arXiv:2104.05334 [cs.RO]
	(or arXiv:2104.05334v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2104.05334

Submission history

From: Michael Koller [view email]
[v1] Mon, 12 Apr 2021 10:26:13 UTC (72 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 2021-04

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Michael Koller
Timothy Patten
Markus Vincze

export BibTeX citation

Computer Science > Robotics

Title:Risk-Averse Biased Human Policies in Assistive Multi-Armed Bandit Settings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Risk-Averse Biased Human Policies in Assistive Multi-Armed Bandit Settings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators