Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi

Siu, Ho Chit; Pena, Jaime D.; Chang, Kimberlee C.; Chen, Edenna; Zhou, Yutai; Lopez, Victor J.; Palko, Kyle; Allen, Ross E.

Computer Science > Artificial Intelligence

arXiv:2107.07630v2 (cs)

[Submitted on 15 Jul 2021 (v1), revised 20 Jul 2021 (this version, v2), latest version 21 Oct 2021 (v3)]

Title:Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi

Authors:Ho Chit Siu, Jaime D. Pena, Kimberlee C. Chang, Edenna Chen, Yutai Zhou, Victor J. Lopez, Kyle Palko, Ross E. Allen

View PDF

Abstract:Deep reinforcement learning has generated superhuman AI in competitive games such as Go and StarCraft. Can similar learning techniques create a superior AI teammate for human-machine collaborative games? Will humans prefer AI teammates that improve objective team performance or those that improve subjective metrics of trust? In this study, we perform a single-blind evaluation of teams of humans and AI agents in the cooperative card game Hanabi, with both rule-based and learning-based agents. In addition to the game score, used as an objective metric of the human-AI team performance, we also quantify subjective measures of the human's perceived performance, teamwork, interpretability, trust, and overall preference of AI teammate. We find that humans have a clear preference toward a rule-based AI teammate (SmartBot) over a state-of-the-art learning-based AI teammate (Other-Play) across nearly all subjective metrics, and generally view the learning-based agent negatively, despite no statistical difference in the game score. This result has implications for future AI design and reinforcement learning benchmarking, highlighting the need to incorporate subjective metrics of human-AI teaming rather than a singular focus on objective task performance.

Subjects:	Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2107.07630 [cs.AI]
	(or arXiv:2107.07630v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2107.07630

Submission history

From: Ho Chit Siu [view email]
[v1] Thu, 15 Jul 2021 22:19:15 UTC (15,458 KB)
[v2] Tue, 20 Jul 2021 03:15:47 UTC (15,459 KB)
[v3] Thu, 21 Oct 2021 18:20:15 UTC (14,535 KB)

Computer Science > Artificial Intelligence

Title:Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators