APS: Active Pretraining with Successor Features

Liu, Hao; Abbeel, Pieter

Computer Science > Machine Learning

arXiv:2108.13956 (cs)

[Submitted on 31 Aug 2021]

Title:APS: Active Pretraining with Successor Features

Authors:Hao Liu, Pieter Abbeel

View PDF

Abstract:We introduce a new unsupervised pretraining objective for reinforcement learning. During the unsupervised reward-free pretraining phase, the agent maximizes mutual information between tasks and states induced by the policy. Our key contribution is a novel lower bound of this intractable quantity. We show that by reinterpreting and combining variational successor features~\citep{Hansen2020Fast} with nonparametric entropy maximization~\citep{liu2021behavior}, the intractable mutual information can be efficiently optimized. The proposed method Active Pretraining with Successor Feature (APS) explores the environment via nonparametric entropy maximization, and the explored data can be efficiently leveraged to learn behavior by variational successor features. APS addresses the limitations of existing mutual information maximization based and entropy maximization based unsupervised RL, and combines the best of both worlds. When evaluated on the Atari 100k data-efficiency benchmark, our approach significantly outperforms previous methods combining unsupervised pretraining with task-specific finetuning.

Comments:	Appeared in ICML 2021
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2108.13956 [cs.LG]
	(or arXiv:2108.13956v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2108.13956

Submission history

From: Hao Liu [view email]
[v1] Tue, 31 Aug 2021 16:30:35 UTC (1,368 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hao Liu
Pieter Abbeel

export BibTeX citation

Computer Science > Machine Learning

Title:APS: Active Pretraining with Successor Features

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:APS: Active Pretraining with Successor Features

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators