Towards sample-efficient episodic control with DAC-ML

Freire, Ismael T.; Amil, Adrián F.; Vouloutsi, Vasiliki; Verschure, Paul F. M. J.

Computer Science > Artificial Intelligence

arXiv:2012.13779 (cs)

[Submitted on 26 Dec 2020]

Title:Towards sample-efficient episodic control with DAC-ML

Authors:Ismael T. Freire, Adrián F. Amil, Vasiliki Vouloutsi, Paul F.M.J. Verschure

View PDF

Abstract:The sample-inefficiency problem in Artificial Intelligence refers to the inability of current Deep Reinforcement Learning models to optimize action policies within a small number of episodes. Recent studies have tried to overcome this limitation by adding memory systems and architectural biases to improve learning speed, such as in Episodic Reinforcement Learning. However, despite achieving incremental improvements, their performance is still not comparable to how humans learn behavioral policies. In this paper, we capitalize on the design principles of the Distributed Adaptive Control (DAC) theory of mind and brain to build a novel cognitive architecture (DAC-ML) that, by incorporating a hippocampus-inspired sequential memory system, can rapidly converge to effective action policies that maximize reward acquisition in a challenging foraging task.

Comments:	8 pages, 3 figures
Subjects:	Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC); Machine Learning (stat.ML)
Cite as:	arXiv:2012.13779 [cs.AI]
	(or arXiv:2012.13779v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2012.13779

Submission history

From: Ismael Tito Freire González [view email]
[v1] Sat, 26 Dec 2020 16:38:08 UTC (5,990 KB)

Computer Science > Artificial Intelligence

Title:Towards sample-efficient episodic control with DAC-ML

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Towards sample-efficient episodic control with DAC-ML

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators