Risk-Averse Control of Markov Systems with Value Function Learning

Ruszczynski, Andrzej; Yang, Shangzhe

Mathematics > Optimization and Control

arXiv:2312.00946 (math)

[Submitted on 1 Dec 2023]

Title:Risk-Averse Control of Markov Systems with Value Function Learning

Authors:Andrzej Ruszczynski, Shangzhe Yang

View PDF

Abstract:We consider a control problem for a finite-state Markov system whose performance is evaluated by a coherent Markov risk measure. For each policy, the risk of a state is approximated by a function of its features, thus leading to a lower-dimensional policy evaluation problem, which involves non-differentiable stochastic operators. We introduce mini-batch transition risk mappings, which are particularly suited to our approach, and we use them to derive a robust learning algorithm for Markov policy evaluation. Finally, we discuss structured policy improvement in the feature-based risk-averse setting. The considerations are illustrated with an underwater robot navigation problem in which several waypoints must be visited and the observation results must be reported from selected transmission locations. We identify the relevant features, we test the simulation-based learning method, and we optimize a structured policy in a hyperspace containing all problems with the same number of relevant points.

Subjects:	Optimization and Control (math.OC)
Cite as:	arXiv:2312.00946 [math.OC]
	(or arXiv:2312.00946v1 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2312.00946

Submission history

From: Andrzej Ruszczyński [view email]
[v1] Fri, 1 Dec 2023 21:58:49 UTC (139 KB)

Mathematics > Optimization and Control

Title:Risk-Averse Control of Markov Systems with Value Function Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Risk-Averse Control of Markov Systems with Value Function Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators