Discounted continuous-time constrained Markov decision processes in Polish spaces

Guo, Xianping; Song, Xinyuan

doi:10.1214/10-AAP749

Mathematics > Probability

arXiv:1201.0089 (math)

[Submitted on 30 Dec 2011]

Title:Discounted continuous-time constrained Markov decision processes in Polish spaces

Authors:Xianping Guo, Xinyuan Song

View PDF

Abstract:This paper is devoted to studying constrained continuous-time Markov decision processes (MDPs) in the class of randomized policies depending on state histories. The transition rates may be unbounded, the reward and costs are admitted to be unbounded from above and from below, and the state and action spaces are Polish spaces. The optimality criterion to be maximized is the expected discounted rewards, and the constraints can be imposed on the expected discounted costs. First, we give conditions for the nonexplosion of underlying processes and the finiteness of the expected discounted rewards/costs. Second, using a technique of occupation measures, we prove that the constrained optimality of continuous-time MDPs can be transformed to an equivalent (optimality) problem over a class of probability measures. Based on the equivalent problem and a so-called $\bar{w}$-weak convergence of probability measures developed in this paper, we show the existence of a constrained optimal policy. Third, by providing a linear programming formulation of the equivalent problem, we show the solvability of constrained optimal policies. Finally, we use two computable examples to illustrate our main results.

Comments:	Published in at this http URL the Annals of Applied Probability (this http URL) by the Institute of Mathematical Statistics (this http URL)
Subjects:	Probability (math.PR)
Report number:	IMS-AAP-AAP749
Cite as:	arXiv:1201.0089 [math.PR]
	(or arXiv:1201.0089v1 [math.PR] for this version)
	https://doi.org/10.48550/arXiv.1201.0089
Journal reference:	Annals of Applied Probability 2011, Vol. 21, No. 5, 2016-2049
Related DOI:	https://doi.org/10.1214/10-AAP749

Submission history

From: Xianping Guo [view email] [via VTEX proxy]
[v1] Fri, 30 Dec 2011 09:33:57 UTC (53 KB)

Mathematics > Probability

Title:Discounted continuous-time constrained Markov decision processes in Polish spaces

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Probability

Title:Discounted continuous-time constrained Markov decision processes in Polish spaces

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators