Stable Adaptive Control Using New Critic Designs

Werbos, Paul J.

doi:10.1117/12.343068

Adaptation, Noise, and Self-Organizing Systems

arXiv:adap-org/9810001 (adap-org)

[Submitted on 25 Sep 1998 (v1), last revised 20 Nov 2012 (this version, v2)]

Title:Stable Adaptive Control Using New Critic Designs

Authors:Paul J. Werbos (NSF)

View PDF

Abstract:Classical adaptive control proves total-system stability for control of linear plants, but only for plants meeting very restrictive assumptions. Approximate Dynamic Programming (ADP) has the potential, in principle, to ensure stability without such tight restrictions. It also offers nonlinear and neural extensions for optimal control, with empirically supported links to what is seen in the brain. However, the relevant ADP methods in use today -- TD, HDP, DHP, GDHP -- and the Galerkin-based versions of these all have serious limitations when used here as parallel distributed real-time learning systems; either they do not possess quadratic unconditional stability (to be defined) or they lead to incorrect results in the stochastic case. (ADAC or Q-learning designs do not help.) After explaining these conclusions, this paper describes new ADP designs which overcome these limitations. It also addresses the Generalized Moving Target problem, a common family of static optimization problems, and describes a way to stabilize large-scale economic equilibrium models, such as the old long-term energy model of DOE.

Comments:	Includes general reviews of alternative control technologies and reinforcement learning. 4 figs, >70p., >200 eqs. Implementation details, stability analysis. Included in 9/24/98 patent disclosure. pdf version uploaded 2012, based on direct conversion of the original word/html file, because of issues of format compatability
Subjects:	Adaptation and Self-Organizing Systems (nlin.AO)
Cite as:	arXiv:adap-org/9810001
	(or arXiv:adap-org/9810001v2 for this version)
	https://doi.org/10.48550/arXiv.adap-org/9810001
Related DOI:	https://doi.org/10.1117/12.343068

Submission history

From: Dr. Paul J. Werbos [view email]
[v1] Fri, 25 Sep 1998 22:09:04 UTC (206 KB)
[v2] Tue, 20 Nov 2012 14:34:12 UTC (1,471 KB)

Adaptation, Noise, and Self-Organizing Systems

Title:Stable Adaptive Control Using New Critic Designs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Adaptation, Noise, and Self-Organizing Systems

Title:Stable Adaptive Control Using New Critic Designs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators