Reinforcement Learning Based Optimal Battery Control Under Cycle-based Degradation Cost

Kwon, Kyung-bin; Zhu, Hao

Mathematics > Optimization and Control

arXiv:2108.02374 (math)

[Submitted on 5 Aug 2021 (v1), last revised 2 Jun 2022 (this version, v3)]

Title:Reinforcement Learning Based Optimal Battery Control Under Cycle-based Degradation Cost

Authors:Kyung-bin Kwon, Hao Zhu

View PDF

Abstract:Battery energy storage systems are providing increasing level of benefits to power grid operations by decreasing the resource uncertainty and supporting frequency regulation. Thus, it is crucial to obtain the optimal policy for battery to efficiently provide these grid-services while accounting for its degradation cost. To solve the optimal battery control (OBC) problem using the powerful reinforcement learning (RL) algorithms, this paper aims to develop a new representation of the cycle-based battery degradation model according to the rainflow algorithm. As the latter depends on the full trajectory, existing work has to rely on linearized approximation for converting it into instantaneous terms for the Markov Decision Process (MDP) based formulation. We propose a new MDP form by introducing additional state variables that can easily keep track of past switching points for determining the cycle depth. The proposed degradation model allows to adopt the powerful deep Q-Network (DQN) based RL algorithm to efficiently search for the OBC policy. Numerical tests using real market data have demonstrated the performance improvements of the proposed cycle-based degradation model in enhancing the battery operations while mitigating its degradation, as compared to earlier work using the linearized approximation.

Subjects:	Optimization and Control (math.OC)
Cite as:	arXiv:2108.02374 [math.OC]
	(or arXiv:2108.02374v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2108.02374

Submission history

From: Kyung-Bin Kwon [view email]
[v1] Thu, 5 Aug 2021 05:12:07 UTC (375 KB)
[v2] Thu, 17 Feb 2022 06:16:46 UTC (340 KB)
[v3] Thu, 2 Jun 2022 18:56:04 UTC (1,183 KB)

Mathematics > Optimization and Control

Title:Reinforcement Learning Based Optimal Battery Control Under Cycle-based Degradation Cost

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Reinforcement Learning Based Optimal Battery Control Under Cycle-based Degradation Cost

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators