基于深度确定性策略梯度的主网多区域潮流动态协调优化

安静

基于深度确定性策略梯度的主网多区域潮流动态协调优化

安静

Dynamic coordination optimization of multi regional power flow in the main network based on deep deterministic policy gradient

AN Jing

摘要

摘要: 针对主网多区域互联背景下,现有集中式优化方法面临数据处理困难、动态响应滞后,且无法有效解决主网电压越限的问题,开展基于深度确定性策略梯度的主网多区域潮流动态协调优化研究。通过构建可解耦的多区域最优潮流模型与分布式协调框架,为智能体学习奠定基础。通过设计以系统状态、协调动作与安全经济奖励为核心的深度强化学习环境,实现对高维动态特征的提取与智能决策。通过引入半正定松弛技术处理区域内部非凸潮流约束,保障子问题求解的可靠性与效率。通过对比实验证明,所提方法在保障系统安全、提升经济性、加快计算速度及优化资源利用方面均显著优于现有方法,为实现跨区域电网的实时智能协调运行提供了有效方案。

Abstract: In the context of multi regional interconnection in the main network, existing centralized optimization methods face difficulties in data processing, delayed dynamic response, and cannot effectively solve the problem of voltage exceeding the limit of the main network. Therefore, research on dynamic coordination optimization of multi regional power flow in the main network based on deep deterministic policy gradient is carried out. By constructing a decoupled multi region optimal power flow model and a distributed coordination framework, a foundation is laid for intelligent agent learning. By designing a deep reinforcement learning environment with system state, coordinated actions, and safety economic rewards as its core, high-dimensional dynamic feature extraction and intelligent decision-making can be achieved. By introducing the semi positive definite relaxation technique to handle non convex flow constraints within the region, the reliability and efficiency of solving subproblems are ensured. Through comparative experiments, it has been proven that the proposed method is significantly superior to existing methods in ensuring system security, improving economy, accelerating computing speed, and optimizing resource utilization, providing an effective solution for achieving real-time intelligent coordinated operation of cross regional power grids.

HTML全文

参考文献(0)

施引文献