Constrained Reinforcement Learning for Safe Heat Pump Control

Zhang, Baohe; Frison, Lilli; Brox, Thomas; Bödecker, Joschka

Computer Science > Machine Learning

arXiv:2409.19716 (cs)

[Submitted on 29 Sep 2024]

Title:Constrained Reinforcement Learning for Safe Heat Pump Control

Authors:Baohe Zhang, Lilli Frison, Thomas Brox, Joschka Bödecker

View PDF HTML (experimental)

Abstract:Constrained Reinforcement Learning (RL) has emerged as a significant research area within RL, where integrating constraints with rewards is crucial for enhancing safety and performance across diverse control tasks. In the context of heating systems in the buildings, optimizing the energy efficiency while maintaining the residents' thermal comfort can be intuitively formulated as a constrained optimization problem. However, to solve it with RL may require large amount of data. Therefore, an accurate and versatile simulator is favored. In this paper, we propose a novel building simulator I4B which provides interfaces for different usages and apply a model-free constrained RL algorithm named constrained Soft Actor-Critic with Linear Smoothed Log Barrier function (CSAC-LB) to the heating optimization problem. Benchmarking against baseline algorithms demonstrates CSAC-LB's efficiency in data exploration, constraint satisfaction and performance.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
Cite as:	arXiv:2409.19716 [cs.LG]
	(or arXiv:2409.19716v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2409.19716

Submission history

From: Lilli Frison [view email]
[v1] Sun, 29 Sep 2024 14:15:13 UTC (870 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-09

Change to browse by:

cs
cs.AI
cs.SY
eess
eess.SY

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Constrained Reinforcement Learning for Safe Heat Pump Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Constrained Reinforcement Learning for Safe Heat Pump Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators