Approximating Euclidean by Imprecise Markov Decision Processes

Jaeger, Manfred; Bacci, Giorgio; Bacci, Giovanni; Larsen, Kim Guldstrand; Jensen, Peter Gjøl

Computer Science > Artificial Intelligence

arXiv:2006.14923 (cs)

[Submitted on 26 Jun 2020]

Title:Approximating Euclidean by Imprecise Markov Decision Processes

Authors:Manfred Jaeger, Giorgio Bacci, Giovanni Bacci, Kim Guldstrand Larsen, Peter Gjøl Jensen

View PDF

Abstract:Euclidean Markov decision processes are a powerful tool for modeling control problems under uncertainty over continuous domains. Finite state imprecise, Markov decision processes can be used to approximate the behavior of these infinite models. In this paper we address two questions: first, we investigate what kind of approximation guarantees are obtained when the Euclidean process is approximated by finite state approximations induced by increasingly fine partitions of the continuous state space. We show that for cost functions over finite time horizons the approximations become arbitrarily precise. Second, we use imprecise Markov decision process approximations as a tool to analyse and validate cost functions and strategies obtained by reinforcement learning. We find that, on the one hand, our new theoretical results validate basic design choices of a previously proposed reinforcement learning approach. On the other hand, the imprecise Markov decision process approximations reveal some inaccuracies in the learned cost functions.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2006.14923 [cs.AI]
	(or arXiv:2006.14923v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2006.14923

Submission history

From: Manfred Jaeger [view email]
[v1] Fri, 26 Jun 2020 11:58:04 UTC (417 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2020-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Manfred Jaeger
Giorgio Bacci
Giovanni Bacci
Kim Guldstrand Larsen

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Approximating Euclidean by Imprecise Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Approximating Euclidean by Imprecise Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators