ELSIM: End-to-end learning of reusable skills through intrinsic motivation

Aubret, Arthur; Matignon, Laetitia; Hassas, Salima

Computer Science > Artificial Intelligence

arXiv:2006.12903 (cs)

[Submitted on 23 Jun 2020]

Title:ELSIM: End-to-end learning of reusable skills through intrinsic motivation

Authors:Arthur Aubret, Laetitia Matignon, Salima Hassas

View PDF

Abstract:Taking inspiration from developmental learning, we present a novel reinforcement learning architecture which hierarchically learns and represents self-generated skills in an end-to-end way. With this architecture, an agent focuses only on task-rewarded skills while keeping the learning process of skills bottom-up. This bottom-up approach allows to learn skills that 1- are transferable across tasks, 2- improves exploration when rewards are sparse. To do so, we combine a previously defined mutual information objective with a novel curriculum learning algorithm, creating an unlimited and explorable tree of skills. We test our agent on simple gridworld environments to understand and visualize how the agent distinguishes between its skills. Then we show that our approach can scale on more difficult MuJoCo environments in which our agent is able to build a representation of skills which improve over a baseline both transfer learning and exploration when rewards are sparse.

Comments:	Accepted at ECML 2020
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2006.12903 [cs.AI]
	(or arXiv:2006.12903v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2006.12903

Submission history

From: Arthur Aubret [view email]
[v1] Tue, 23 Jun 2020 11:20:46 UTC (2,142 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2020-06

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Laëtitia Matignon
Salima Hassas

export BibTeX citation

Computer Science > Artificial Intelligence

Title:ELSIM: End-to-end learning of reusable skills through intrinsic motivation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:ELSIM: End-to-end learning of reusable skills through intrinsic motivation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators