Episodic Memory Deep Q-Networks

Lin, Zichuan; Zhao, Tianqi; Yang, Guangwen; Zhang, Lintao

Computer Science > Machine Learning

arXiv:1805.07603 (cs)

[Submitted on 19 May 2018]

Title:Episodic Memory Deep Q-Networks

Authors:Zichuan Lin, Tianqi Zhao, Guangwen Yang, Lintao Zhang

View PDF

Abstract:Reinforcement learning (RL) algorithms have made huge progress in recent years by leveraging the power of deep neural networks (DNN). Despite the success, deep RL algorithms are known to be sample inefficient, often requiring many rounds of interaction with the environments to obtain satisfactory performance. Recently, episodic memory based RL has attracted attention due to its ability to latch on good actions quickly. In this paper, we present a simple yet effective biologically inspired RL algorithm called Episodic Memory Deep Q-Networks (EMDQN), which leverages episodic memory to supervise an agent during training. Experiments show that our proposed method can lead to better sample efficiency and is more likely to find good policies. It only requires 1/5 of the interactions of DQN to achieve many state-of-the-art performances on Atari games, significantly outperforming regular DQN and other episodic memory based RL algorithms.

Comments:	Accepted by IJCAI 2018
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1805.07603 [cs.LG]
	(or arXiv:1805.07603v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.07603

Submission history

From: Zichuan Lin [view email]
[v1] Sat, 19 May 2018 14:33:00 UTC (833 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-05

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zichuan Lin
Tianqi Zhao
Guangwen Yang
Lintao Zhang

export BibTeX citation

Computer Science > Machine Learning

Title:Episodic Memory Deep Q-Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Episodic Memory Deep Q-Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators