A General Family of Robust Stochastic Operators for Reinforcement Learning

Lu, Yingdong; Squillante, Mark S.; Wu, Chai Wah

Statistics > Machine Learning

arXiv:1805.08122 (stat)

[Submitted on 21 May 2018 (v1), last revised 28 May 2019 (this version, v2)]

Title:A General Family of Robust Stochastic Operators for Reinforcement Learning

Authors:Yingdong Lu, Mark S. Squillante, Chai Wah Wu

View PDF

Abstract:We consider a new family of operators for reinforcement learning with the goal of alleviating the negative effects and becoming more robust to approximation or estimation errors. Various theoretical results are established, which include showing on a sample path basis that our family of operators preserve optimality and increase the action gap. Our empirical results illustrate the strong benefits of our family of operators, significantly outperforming the classical Bellman operator and recently proposed operators.

Comments:	12 pages
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1805.08122 [stat.ML]
	(or arXiv:1805.08122v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1805.08122

Submission history

From: Yingdong Lu [view email]
[v1] Mon, 21 May 2018 15:30:54 UTC (728 KB)
[v2] Tue, 28 May 2019 17:15:42 UTC (374 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat.ML

< prev | next >

new | recent | 2018-05

Change to browse by:

cs
cs.LG
stat

References & Citations

export BibTeX citation

Statistics > Machine Learning

Title:A General Family of Robust Stochastic Operators for Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:A General Family of Robust Stochastic Operators for Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators