Explaining Reinforcement Learning with Shapley Values

Beechey, Daniel; Smith, Thomas M. S.; Şimşek, Özgür

Computer Science > Machine Learning

arXiv:2306.05810 (cs)

[Submitted on 9 Jun 2023]

Title:Explaining Reinforcement Learning with Shapley Values

Authors:Daniel Beechey, Thomas M. S. Smith, Özgür Şimşek

View PDF

Abstract:For reinforcement learning systems to be widely adopted, their users must understand and trust them. We present a theoretical analysis of explaining reinforcement learning using Shapley values, following a principled approach from game theory for identifying the contribution of individual players to the outcome of a cooperative game. We call this general framework Shapley Values for Explaining Reinforcement Learning (SVERL). Our analysis exposes the limitations of earlier uses of Shapley values in reinforcement learning. We then develop an approach that uses Shapley values to explain agent performance. In a variety of domains, SVERL produces meaningful explanations that match and supplement human intuition.

Comments:	12 pages, 9 figures. Accepted at ICML 2023
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2306.05810 [cs.LG]
	(or arXiv:2306.05810v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.05810

Submission history

From: Daniel Beechey [view email]
[v1] Fri, 9 Jun 2023 10:52:39 UTC (137 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2023-06

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Explaining Reinforcement Learning with Shapley Values

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Explaining Reinforcement Learning with Shapley Values

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators