Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning

Fu, Haotian; Tang, Hongyao; Hao, Jianye; Chen, Chen; Feng, Xidong; Li, Dong; Liu, Wulong

Computer Science > Machine Learning

arXiv:2009.13891 (cs)

[Submitted on 29 Sep 2020 (v1), last revised 15 Dec 2020 (this version, v3)]

Title:Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning

Authors:Haotian Fu, Hongyao Tang, Jianye Hao, Chen Chen, Xidong Feng, Dong Li, Wulong Liu

View PDF

Abstract:Context, the embedding of previous collected trajectories, is a powerful construct for Meta-Reinforcement Learning (Meta-RL) algorithms. By conditioning on an effective context, Meta-RL policies can easily generalize to new tasks within a few adaptation steps. We argue that improving the quality of context involves answering two questions: 1. How to train a compact and sufficient encoder that can embed the task-specific information contained in prior trajectories? 2. How to collect informative trajectories of which the corresponding context reflects the specification of tasks? To this end, we propose a novel Meta-RL framework called CCM (Contrastive learning augmented Context-based Meta-RL). We first focus on the contrastive nature behind different tasks and leverage it to train a compact and sufficient context encoder. Further, we train a separate exploration policy and theoretically derive a new information-gain-based objective which aims to collect informative trajectories in a few steps. Empirically, we evaluate our approaches on common benchmarks as well as several complex sparse-reward environments. The experimental results show that CCM outperforms state-of-the-art algorithms by addressing previously mentioned problems respectively.

Comments:	Accepted to AAAI 2021
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2009.13891 [cs.LG]
	(or arXiv:2009.13891v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2009.13891

Submission history

From: Haotian Fu [view email]
[v1] Tue, 29 Sep 2020 09:29:18 UTC (8,474 KB)
[v2] Wed, 7 Oct 2020 12:10:03 UTC (8,541 KB)
[v3] Tue, 15 Dec 2020 08:48:23 UTC (8,541 KB)

Computer Science > Machine Learning

Title:Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators