Distributionally Robust Reinforcement Learning

Smirnova, Elena; Dohmatob, Elvis; Mary, Jérémie

Statistics > Machine Learning

arXiv:1902.08708v1 (stat)

[Submitted on 23 Feb 2019 (this version), latest version 14 Jun 2019 (v2)]

Title:Distributionally Robust Reinforcement Learning

Authors:Elena Smirnova, Elvis Dohmatob, Jérémie Mary

View PDF

Abstract:Generalization to unknown/uncertain environments of reinforcement learning algorithms is crucial for real-world applications. In this work, we explicitly consider uncertainty associated with the test environment through an uncertainty set. We formulate the Distributionally Robust Reinforcement Learning (DR-RL) objective that consists in maximizing performance against a worst-case policy in uncertainty set centered at the reference policy. Based on this objective, we derive computationally efficient policy improvement algorithm that benefits from Distributionally Robust Optimization (DRO) guarantees. Further, we propose an iterative procedure that increases stability of learning, called Distributionally Robust Policy Iteration. Combined with maximum entropy framework, we derive a distributionally robust variant of Soft Q-learning that enjoys efficient practical implementation and produces policies with robust behaviour at test time. Our formulation provides a unified view on a number of safe RL algorithms and recent empirical successes.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1902.08708 [stat.ML]
	(or arXiv:1902.08708v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1902.08708

Submission history

From: Elena Smirnova [view email]
[v1] Sat, 23 Feb 2019 00:13:42 UTC (61 KB)
[v2] Fri, 14 Jun 2019 04:58:55 UTC (350 KB)

Statistics > Machine Learning

Title:Distributionally Robust Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Distributionally Robust Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators