Superstition in the Network: Deep Reinforcement Learning Plays Deceptive Games

Bontrager, Philip; Khalifa, Ahmed; Anderson, Damien; Stephenson, Matthew; Salge, Christoph; Togelius, Julian

Computer Science > Machine Learning

arXiv:1908.04436 (cs)

[Submitted on 12 Aug 2019]

Title:Superstition in the Network: Deep Reinforcement Learning Plays Deceptive Games

Authors:Philip Bontrager, Ahmed Khalifa, Damien Anderson, Matthew Stephenson, Christoph Salge, Julian Togelius

View PDF

Abstract:Deep reinforcement learning has learned to play many games well, but failed on others. To better characterize the modes and reasons of failure of deep reinforcement learners, we test the widely used Asynchronous Actor-Critic (A2C) algorithm on four deceptive games, which are specially designed to provide challenges to game-playing agents. These games are implemented in the General Video Game AI framework, which allows us to compare the behavior of reinforcement learning-based agents with planning agents based on tree search. We find that several of these games reliably deceive deep reinforcement learners, and that the resulting behavior highlights the shortcomings of the learning algorithm. The particular ways in which agents fail differ from how planning-based agents fail, further illuminating the character of these algorithms. We propose an initial typology of deceptions which could help us better understand pitfalls and failure modes of (deep) reinforcement learning.

Comments:	7 pages, 4 figures, Accepted at the 15th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE 19)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1908.04436 [cs.LG]
	(or arXiv:1908.04436v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1908.04436

Submission history

From: Philip Bontrager [view email]
[v1] Mon, 12 Aug 2019 23:27:26 UTC (1,144 KB)

Computer Science > Machine Learning

Title:Superstition in the Network: Deep Reinforcement Learning Plays Deceptive Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Superstition in the Network: Deep Reinforcement Learning Plays Deceptive Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators