Full text

Turn on search term navigation

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

The hybrid energy storage system (HESS) that combines battery with hydrogen storage exploits complementary power/energy characteristics, but most studies optimize capacity and operation separately, leading to suboptimal overall performance. To address this issue, this paper proposes a bi-level co-optimization framework that integrates deep reinforcement learning (DRL) and mixed integer programming (MIP). The outer layer employs the TD3 algorithm for capacity configuration, while the inner layer uses the Gurobi solver for optimal operation under constraints. On a standalone PV–wind–load-HESS system, the method attains near-optimal quality at dramatically lower runtime. Relative to GA + Gurobi and PSO + Gurobi, the cost is lower by 4.67% and 1.31%, while requiring only 0.52% and 0.58% of their runtime; compared with a direct Gurobi solve, the cost remains comparable while runtime decreases to 0.07%. Sensitivity analysis further validates the model’s robustness under various cost parameters and renewable energy penetration levels. These results indicate that the proposed DRL–MIP cooperation achieves near-optimal solutions with orders of magnitude speedups. This study provides a new DRL–MIP paradigm for efficiently solving strongly coupled bi-level optimization problems in energy systems.

Details

Title
Co-Optimization of Capacity and Operation for Battery-Hydrogen Hybrid Energy Storage Systems Based on Deep Reinforcement Learning and Mixed Integer Programming
Author
Qian Tiantian 1 ; Zhang Kaifeng 2 ; Shi Difen 3 ; Zhang, Lei 3 

 School of Electronic Engineering, Nanjing Xiaozhuang University, Nanjing 211171, China; [email protected] (D.S.); [email protected] (L.Z.), School of Automation, Southeast University, Nanjing 210096, China; [email protected] 
 School of Automation, Southeast University, Nanjing 210096, China; [email protected] 
 School of Electronic Engineering, Nanjing Xiaozhuang University, Nanjing 211171, China; [email protected] (D.S.); [email protected] (L.Z.) 
First page
5638
Publication year
2025
Publication date
2025
Publisher
MDPI AG
e-ISSN
19961073
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3271027226
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.