Learning Invariances for Policy Generalization

Tachet, Remi; Bachman, Philip; van Seijen, Harm

Computer Science > Machine Learning

arXiv:1809.02591 (cs)

[Submitted on 7 Sep 2018 (v1), last revised 12 Dec 2020 (this version, v2)]

Title:Learning Invariances for Policy Generalization

Authors:Remi Tachet, Philip Bachman, Harm van Seijen

View PDF

Abstract:While recent progress has spawned very powerful machine learning systems, those agents remain extremely specialized and fail to transfer the knowledge they gain to similar yet unseen tasks. In this paper, we study a simple reinforcement learning problem and focus on learning policies that encode the proper invariances for generalization to different settings. We evaluate three potential methods for policy generalization: data augmentation, meta-learning and adversarial training. We find our data augmentation method to be effective, and study the potential of meta-learning and adversarial learning as alternative task-agnostic approaches.

Comments:	7 pages, 1 figure
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1809.02591 [cs.LG]
	(or arXiv:1809.02591v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1809.02591

Submission history

From: Remi Tachet Des Combes [view email]
[v1] Fri, 7 Sep 2018 17:32:19 UTC (190 KB)
[v2] Sat, 12 Dec 2020 12:57:19 UTC (193 KB)

Computer Science > Machine Learning

Title:Learning Invariances for Policy Generalization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Invariances for Policy Generalization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators