Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning

Koutas, Daniel; Hettegger, Daniel; Papakonstantinou, Kostas G.; Straub, Daniel

Computer Science > Machine Learning

arXiv:2502.09298 (cs)

[Submitted on 13 Feb 2025]

Title:Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning

Authors:Daniel Koutas, Daniel Hettegger, Kostas G. Papakonstantinou, Daniel Straub

View PDF HTML (experimental)

Abstract:We present a novel method for Deep Reinforcement Learning (DRL), incorporating the convex property of the value function over the belief space in Partially Observable Markov Decision Processes (POMDPs). We introduce hard- and soft-enforced convexity as two different approaches, and compare their performance against standard DRL on two well-known POMDP environments, namely the Tiger and FieldVisionRockSample problems. Our findings show that including the convexity feature can substantially increase performance of the agents, as well as increase robustness over the hyperparameter space, especially when testing on out-of-distribution domains. The source code for this work can be found at this https URL.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2502.09298 [cs.LG]
	(or arXiv:2502.09298v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.09298

Submission history

From: Daniel Koutas [view email]
[v1] Thu, 13 Feb 2025 13:12:16 UTC (5,304 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2025-02

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators