A General Framework for Bounding Approximate Dynamic Programming Schemes

Liu, Yajing; Chong, Edwin; Pezeshki, Ali; Zhang, Zhenliang

doi:10.1109/LCSYS.2020.3003477

Mathematics > Optimization and Control

arXiv:1809.05249 (math)

[Submitted on 14 Sep 2018 (v1), last revised 16 Jun 2020 (this version, v5)]

Title:A General Framework for Bounding Approximate Dynamic Programming Schemes

Authors:Yajing Liu, Edwin Chong, Ali Pezeshki, Zhenliang Zhang

View PDF

Abstract:For years, there has been interest in approximation methods for solving dynamic programming problems, because of the inherent complexity in computing optimal solutions characterized by Bellman's principle of optimality. A wide range of approximate dynamic programming (ADP) methods now exists. It is of great interest to guarantee that the performance of an ADP scheme be at least some known fraction, say $\beta$, of optimal. This paper introduces a general approach to bounding the performance of ADP methods, in this sense, in the stochastic setting. The approach is based on new results for bounding greedy solutions in string optimization problems, where one has to choose a string (ordered set) of actions to maximize an objective function. This bounding technique is inspired by submodularity theory, but submodularity is not required for establishing bounds. Instead, the bounding is based on quantifying certain notions of curvature of string functions; the smaller the curvatures the better the bound. The key insight is that any ADP scheme is a greedy scheme for some surrogate string objective function that coincides in its optimal solution and value with those of the original optimal control problem. The ADP scheme then yields to the bounding technique mentioned above, and the curvatures of the surrogate objective determine the value $\beta$ of the bound. The surrogate objective and its curvatures depend on the specific ADP.

Subjects:	Optimization and Control (math.OC)
MSC classes:	control and optimization
Cite as:	arXiv:1809.05249 [math.OC]
	(or arXiv:1809.05249v5 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1809.05249
Related DOI:	https://doi.org/10.1109/LCSYS.2020.3003477

Submission history

From: Yajing Liu [view email]
[v1] Fri, 14 Sep 2018 04:15:02 UTC (67 KB)
[v2] Sun, 17 Mar 2019 01:49:54 UTC (68 KB)
[v3] Tue, 26 Mar 2019 16:30:45 UTC (68 KB)
[v4] Thu, 28 Mar 2019 15:47:29 UTC (68 KB)
[v5] Tue, 16 Jun 2020 20:11:00 UTC (82 KB)

Mathematics > Optimization and Control

Title:A General Framework for Bounding Approximate Dynamic Programming Schemes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:A General Framework for Bounding Approximate Dynamic Programming Schemes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators