When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL

Treven, Lenart; Sukhija, Bhavya; As, Yarden; Dörfler, Florian; Krause, Andreas

Computer Science > Machine Learning

arXiv:2406.01163 (cs)

[Submitted on 3 Jun 2024 (v1), last revised 4 Jun 2024 (this version, v2)]

Title:When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL

Authors:Lenart Treven, Bhavya Sukhija, Yarden As, Florian Dörfler, Andreas Krause

View PDF HTML (experimental)

Abstract:Reinforcement learning (RL) excels in optimizing policies for discrete-time Markov decision processes (MDP). However, various systems are inherently continuous in time, making discrete-time MDPs an inexact modeling choice. In many applications, such as greenhouse control or medical treatments, each interaction (measurement or switching of action) involves manual intervention and thus is inherently costly. Therefore, we generally prefer a time-adaptive approach with fewer interactions with the system. In this work, we formalize an RL framework, Time-adaptive Control & Sensing (TaCoS), that tackles this challenge by optimizing over policies that besides control predict the duration of its application. Our formulation results in an extended MDP that any standard RL algorithm can solve. We demonstrate that state-of-the-art RL algorithms trained on TaCoS drastically reduce the interaction amount over their discrete-time counterpart while retaining the same or improved performance, and exhibiting robustness over discretization frequency. Finally, we propose OTaCoS, an efficient model-based algorithm for our setting. We show that OTaCoS enjoys sublinear regret for systems with sufficiently smooth dynamics and empirically results in further sample-efficiency gains.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2406.01163 [cs.LG]
	(or arXiv:2406.01163v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.01163

Submission history

From: Lenart Treven [view email]
[v1] Mon, 3 Jun 2024 09:57:18 UTC (1,539 KB)
[v2] Tue, 4 Jun 2024 09:06:20 UTC (1,539 KB)

Computer Science > Machine Learning

Title:When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators