A Bayesian Account of Measures of Interpretability in Human-AI Interaction

Sreedharan, Sarath; Kulkarni, Anagha; Chakraborti, Tathagata; Smith, David E.; Kambhampati, Subbarao

Computer Science > Artificial Intelligence

arXiv:2011.10920 (cs)

[Submitted on 22 Nov 2020]

Title:A Bayesian Account of Measures of Interpretability in Human-AI Interaction

Authors:Sarath Sreedharan, Anagha Kulkarni, Tathagata Chakraborti, David E. Smith, Subbarao Kambhampati

View PDF

Abstract:Existing approaches for the design of interpretable agent behavior consider different measures of interpretability in isolation. In this paper we posit that, in the design and deployment of human-aware agents in the real world, notions of interpretability are just some among many considerations; and the techniques developed in isolation lack two key properties to be useful when considered together: they need to be able to 1) deal with their mutually competing properties; and 2) an open world where the human is not just there to interpret behavior in one specific form. To this end, we consider three well-known instances of interpretable behavior studied in existing literature -- namely, explicability, legibility, and predictability -- and propose a revised model where all these behaviors can be meaningfully modeled together. We will highlight interesting consequences of this unified model and motivate, through results of a user study, why this revision is necessary.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2011.10920 [cs.AI]
	(or arXiv:2011.10920v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2011.10920

Submission history

From: Sarath Sreedharan [view email]
[v1] Sun, 22 Nov 2020 03:28:28 UTC (1,652 KB)

Computer Science > Artificial Intelligence

Title:A Bayesian Account of Measures of Interpretability in Human-AI Interaction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:A Bayesian Account of Measures of Interpretability in Human-AI Interaction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators