The Role of Diverse Replay for Generalisation in Reinforcement Learning

Weltevrede, Max; Spaan, Matthijs T. J.; Böhmer, Wendelin

Computer Science > Machine Learning

arXiv:2306.05727 (cs)

[Submitted on 9 Jun 2023 (v1), last revised 31 Aug 2023 (this version, v2)]

Title:The Role of Diverse Replay for Generalisation in Reinforcement Learning

Authors:Max Weltevrede, Matthijs T.J. Spaan, Wendelin Böhmer

View PDF

Abstract:In reinforcement learning (RL), key components of many algorithms are the exploration strategy and replay buffer. These strategies regulate what environment data is collected and trained on and have been extensively studied in the RL literature. In this paper, we investigate the impact of these components in the context of generalisation in multi-task RL. We investigate the hypothesis that collecting and training on more diverse data from the training environments will improve zero-shot generalisation to new tasks. We motivate mathematically and show empirically that generalisation to tasks that are "reachable'' during training is improved by increasing the diversity of transitions in the replay buffer. Furthermore, we show empirically that this same strategy also shows improvement for generalisation to similar but "unreachable'' tasks which could be due to improved generalisation of the learned latent representations.

Comments:	15 pages, 8 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2306.05727 [cs.LG]
	(or arXiv:2306.05727v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.05727

Submission history

From: Max Weltevrede [view email]
[v1] Fri, 9 Jun 2023 07:48:36 UTC (296 KB)
[v2] Thu, 31 Aug 2023 10:54:50 UTC (309 KB)

Computer Science > Machine Learning

Title:The Role of Diverse Replay for Generalisation in Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Role of Diverse Replay for Generalisation in Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators