基于SAC-PSO分层协同强化学习的微网群优化调度策略
Hierarchical Collaborative Microgrid Cluster Optimal Scheduling Strategy Based on SAC-PSO
-
摘要: 针对微网群优化调度中全局协调困难、局部约束复杂及风光出力不确定性强等问题,提出一种基于改进软演员—评论家(SAC)与改进粒子群优化(PSO)的分层协同调度方法。构建由微网群聚合商和多个子微网组成的双层优化模型,其中上层负责微网群整体功率协调,下层负责子微网内部自治优化。针对上层高维连续决策问题,引入变分自编码器(VAE)进行状态压缩,并结合时间衰减优先经验回放机制改进SAC算法,以提升学习效率与环境适应能力;针对下层多约束、非线性优化问题,设计包含自适应惯性权重、精英领导者选择及约束修复机制的改进PSO算法,以增强复杂可行域下的求解能力。Abstract: To address the challenges in the optimal scheduling of microgrid clusters, including difficulties in global coordination, complex local constraints, and high uncertainty in wind and photovoltaic power outputs, a hierarchical cooperative scheduling method based on an improved soft actor–critic (SAC) algorithm and an improved particle swarm optimization (PSO) algorithm is proposed. A two-layer optimization model is constructed, consisting of a microgrid cluster aggregator at the upper layer and multiple sub-microgrids at the lower layer. The upper layer is responsible for overall power coordination of the microgrid cluster, while the lower layer handles the autonomous optimization within each sub-microgrid. For the high-dimensional continuous decision-making problem at the upper layer, a variational autoencoder (VAE) is introduced for state compression, and the SAC algorithm is enhanced with a time-decay prioritized experience replay mechanism to improve learning efficiency and environmental adaptability. For the lower-layer optimization problem, which involves multiple constraints and nonlinearity, an improved PSO algorithm is designed, incorporating adaptive inertia weights, elite leader selection, and constraint repair mechanisms to enhance its ability to solve problems within complex feasible regions.
下载: