The Termination Critic

Harutyunyan, Anna; Dabney, Will; Borsa, Diana; Heess, Nicolas; Munos, Remi; Precup, Doina

Computer Science > Artificial Intelligence

arXiv:1902.09996 (cs)

[Submitted on 26 Feb 2019]

Title:The Termination Critic

Authors:Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Remi Munos, Doina Precup

View PDF

Abstract:In this work, we consider the problem of autonomously discovering behavioral abstractions, or options, for reinforcement learning agents. We propose an algorithm that focuses on the termination condition, as opposed to -- as is common -- the policy. The termination condition is usually trained to optimize a control objective: an option ought to terminate if another has better value. We offer a different, information-theoretic perspective, and propose that terminations should focus instead on the compressibility of the option's encoding -- arguably a key reason for using abstractions. To achieve this algorithmically, we leverage the classical options framework, and learn the option transition model as a "critic" for the termination condition. Using this model, we derive gradients that optimize the desired criteria. We show that the resulting options are non-trivial, intuitively meaningful, and useful for learning and planning.

Comments:	AISTATS 2019
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1902.09996 [cs.AI]
	(or arXiv:1902.09996v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1902.09996

Submission history

From: Anna Harutyunyan [view email]
[v1] Tue, 26 Feb 2019 15:26:10 UTC (5,712 KB)

Computer Science > Artificial Intelligence

Title:The Termination Critic

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:The Termination Critic

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators