Towards an Unified Structure for Reinforcement Learning: an Optimization Approach

Shi, Jicheng; Lian, Yingzhao; Jones, Colin N.

Electrical Engineering and Systems Science > Systems and Control

arXiv:2002.06883 (eess)

[Submitted on 17 Feb 2020 (v1), last revised 1 Jun 2020 (this version, v3)]

Title:Towards an Unified Structure for Reinforcement Learning: an Optimization Approach

Authors:Jicheng Shi, Yingzhao Lian, Colin N. Jones

View PDF

Abstract:Both the optimal value function and the optimal policy can be used to model an optimal controller based on the duality established by the Bellman equation. Even with this duality, no parametric model has been able to output both policy and value function with a common parameter set. In this paper, a unified structure is proposed with a parametric optimization problem. The policy and the value function modelled by this structure share all parameters, which enables seamless switching among reinforcement learning algorithms while continuing to learn. The Q-learning and policy gradient based on the proposed structure is detailed. An actor-critic algorithm based on this structure, whose actor and critic are both modelled by the same parameters, is validated by both linear and nonlinear control.

Subjects:	Systems and Control (eess.SY)
Cite as:	arXiv:2002.06883 [eess.SY]
	(or arXiv:2002.06883v3 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2002.06883

Submission history

From: Jicheng Shi [view email]
[v1] Mon, 17 Feb 2020 10:58:42 UTC (363 KB)
[v2] Sun, 23 Feb 2020 22:03:54 UTC (601 KB)
[v3] Mon, 1 Jun 2020 15:52:52 UTC (684 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Towards an Unified Structure for Reinforcement Learning: an Optimization Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Towards an Unified Structure for Reinforcement Learning: an Optimization Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators