A Unified Framework for Reinforcement Learning

Shi, Jicheng; Lian, Yingzhao; Jones, Colin N.

Electrical Engineering and Systems Science > Systems and Control

arXiv:2002.06883v2 (eess)

[Submitted on 17 Feb 2020 (v1), revised 23 Feb 2020 (this version, v2), latest version 1 Jun 2020 (v3)]

Title:A Unified Framework for Reinforcement Learning

Authors:Jicheng Shi, Yingzhao Lian, Colin N. Jones

View PDF

Abstract:Reinforcement learning has shown strong potential in learning optimal control strategy by modelling policy and/or value function. Even though policy and value function forms duality regarding the Bellman equation, there is no structure unifies this two branches. In this paper, we propose to use an convex optimization layer to combine these two branches which enables universal compatibility with all reinforcement learning algorithm without modification of the model structure. Design and training issues will be explained and validated by both linear and nonlinear control.

Subjects:	Systems and Control (eess.SY)
Cite as:	arXiv:2002.06883 [eess.SY]
	(or arXiv:2002.06883v2 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2002.06883

Submission history

From: Jicheng Shi [view email]
[v1] Mon, 17 Feb 2020 10:58:42 UTC (363 KB)
[v2] Sun, 23 Feb 2020 22:03:54 UTC (601 KB)
[v3] Mon, 1 Jun 2020 15:52:52 UTC (684 KB)

Full-text links:

Access Paper:

view license

Current browse context:

eess.SY

< prev | next >

new | recent | 2020-02

Change to browse by:

cs
cs.SY
eess

References & Citations

export BibTeX citation

Electrical Engineering and Systems Science > Systems and Control

Title:A Unified Framework for Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:A Unified Framework for Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators