Deep Reinforcement Learning With Macro-Actions

Durugkar, Ishan P.; Rosenbaum, Clemens; Dernbach, Stefan; Mahadevan, Sridhar

Computer Science > Machine Learning

arXiv:1606.04615 (cs)

[Submitted on 15 Jun 2016]

Title:Deep Reinforcement Learning With Macro-Actions

Authors:Ishan P. Durugkar, Clemens Rosenbaum, Stefan Dernbach, Sridhar Mahadevan

View PDF

Abstract:Deep reinforcement learning has been shown to be a powerful framework for learning policies from complex high-dimensional sensory inputs to actions in complex tasks, such as the Atari domain. In this paper, we explore output representation modeling in the form of temporal abstraction to improve convergence and reliability of deep reinforcement learning approaches. We concentrate on macro-actions, and evaluate these on different Atari 2600 games, where we show that they yield significant improvements in learning speed. Additionally, we show that they can even achieve better scores than DQN. We offer analysis and explanation for both convergence and final results, revealing a problem deep RL approaches have with sparse reward signals.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1606.04615 [cs.LG]
	(or arXiv:1606.04615v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1606.04615

Submission history

From: Ishan Durugkar [view email]
[v1] Wed, 15 Jun 2016 01:57:40 UTC (525 KB)

Computer Science > Machine Learning

Title:Deep Reinforcement Learning With Macro-Actions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep Reinforcement Learning With Macro-Actions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators