An Actor Critic Method for Free Terminal Time Optimal Control

Burton, Evan; Nakamura-Zimmerer, Tenavi; Gong, Qi; Kang, Wei

Mathematics > Optimization and Control

arXiv:2208.00065 (math)

[Submitted on 29 Jul 2022 (v1), last revised 5 Aug 2022 (this version, v2)]

Title:An Actor Critic Method for Free Terminal Time Optimal Control

Authors:Evan Burton, Tenavi Nakamura-Zimmerer, Qi Gong, Wei Kang

View PDF

Abstract:Optimal control problems with free terminal time present many challenges including nonsmooth and discontinuous control laws, irregular value functions, many local optima, and the curse of dimensionality. To overcome these issues, we propose an adaptation of the model-based actor-critic paradigm from the field of Reinforcement Learning via an exponential transformation to learn an approximate feedback control and value function pair. We demonstrate the algorithm's effectiveness on prototypical examples featuring each of the main pathological issues present in problems of this type.

Subjects:	Optimization and Control (math.OC)
Cite as:	arXiv:2208.00065 [math.OC]
	(or arXiv:2208.00065v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2208.00065

Submission history

From: Evan Burton [view email]
[v1] Fri, 29 Jul 2022 20:39:16 UTC (443 KB)
[v2] Fri, 5 Aug 2022 03:42:04 UTC (443 KB)

Full-text links:

Access Paper:

view license

Current browse context:

math.OC

< prev | next >

new | recent | 2022-08

Change to browse by:

math

References & Citations

export BibTeX citation

Mathematics > Optimization and Control

Title:An Actor Critic Method for Free Terminal Time Optimal Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:An Actor Critic Method for Free Terminal Time Optimal Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators