Enhancing power grid resilience through weather-aware security constraints: A deep reinforcement learning approach with hybrid CNN-GRU architecture

Wu, Yingjun, Feng, Junyu, Chen, Xuejie, Ye, Yujian, Lin, Zhiwei, Yuan, Jiangfan, He, Xueyan, Yin, Zhengxi and Lu, Jiayan 2026. Enhancing power grid resilience through weather-aware security constraints: A deep reinforcement learning approach with hybrid CNN-GRU architecture. Applied Energy 407 , 127363. 10.1016/j.apenergy.2026.127363

[thumbnail of Full-text-Enhancing_Power_Grid_Resilience.pdf]

Preview

PDF - Accepted Post-Print Version
Available under License Creative Commons Attribution.
Download (1MB) | Preview

Official URL: https://doi.org/10.1016/j.apenergy.2026.127363

Abstract

Extreme weather events increasingly challenge the operational resilience of distribution systems by introducing dynamic and uncertain security limits (SLs), alongside data sparsity. Traditional model-based approaches often rely on static assumptions and require complete system modeling, making them difficult to adapt to rapidly evolving weather-induced constraints. To address these limitations, this paper proposes a model-free resilience enhancement framework based on deep reinforcement learning (DRL), integrating real-time weather-aware SL identification and adaptive dispatch. First, an ensemble Bagging-XGBoost model is developed to classify weather severity levels and determine whether static or dynamic SLs should be applied, enabling scenario-adaptive SL switching. Second, a hybrid convolutional neural network–gated recurrent unit (CNN-GRU) model, enhanced by transfer learning, is designed to accurately estimate dynamic SLs under varying weather conditions. The CNN captures spatial meteorological patterns, while the GRU models temporal evolution; transfer learning improves generalization under limited training data. Third, the dispatch problem is formulated as a constrained Markov decision process (CMDP), and solved using a primal–dual deep deterministic policy gradient (PD-DDPG) algorithm that explicitly incorporates SL constraints into the policy learning process. An attention-based meteorological data reconstruction model is further integrated to enhance the quality of input data and training efficiency. Case studies on the improved IEEE-123 test feeder demonstrate that the proposed method reduces average load loss by 23.30 % and 12.10 % compared to CNN-only and GRU-only baselines, respectively. Moreover, it achieves an 88.77 % improvement in computational efficiency over conventional model-based resilience strategies, highlighting its robustness and applicability under limited data and high-impact weather conditions.

Item Type:	Article
Date Type:	Publication
Status:	Published
Schools:	Schools > Engineering
Additional Information:	Rights Retention applied.
Publisher:	Elsevier BV
ISSN:	0306-2619
Date of First Compliant Deposit:	20 January 2026
Date of Acceptance:	5 January 2026
Last Modified:	20 Jan 2026 16:30
URI:	https://orca.cardiff.ac.uk/id/eprint/184021

Actions (repository staff only)

Edit Item

Altmetric

Dimensions

Download Statistics

Downloads

Downloads per month over past year

View more statistics

CORE (COnnecting REpositories)