基于软演员–评论家算法的电力市场发电商报价策略
CSTR:
作者:
作者单位:

1037号;2.华中科技大学

作者简介:

通讯作者:

中图分类号:

TP3

基金项目:

国家自然科学基金项目(面上项目,重点项目,重大项目)


Bidding Strategy for Generation Companies in Electricity Markets Based on the Soft Actor–Critic Algorithm
Author:
Affiliation:

Fund Project:

The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对电力市场中发电商非合作博弈具有动态性强、信息不完全等特点, 提出一种基于软演员–评论家算法 (Soft Actor-Critic, SAC) 的独立智能体报价学习方法.首先,在截距参数化供给函数的基础上,构建以发电商长期收益最大化为目标,并考虑直流潮流约束和节点电价形成机制的电力市场出清模型,将报价函数截距作为发电商的连续决策变量;然后,基于SAC算法构建发电商独立学习框架,通过最大熵目标增强策略探索性和收敛鲁棒性,并结合市场出清结果设计基于收益反馈的状态、动作与奖励映射机制,实现各发电商在无显式通信条件下的自适应策略更新;最后,基于IEEE 3节点和30节点系统开展数值仿真.仿真结果表明,所提出的基于SAC的独立智能体方法能够有效逼近纳什均衡报价策略,具有良好的收敛特性, 并能揭示在高折扣因子条件下电力市场中可能出现的默契合谋行为.

    Abstract:

    In view of the dynamic and partially observable characteristics of non-cooperative bidding games among generation companies in electricity markets, this paper proposes a bidding strategy approach based on the Soft Actor–Critic (SAC) algorithm. First, on the basis of an intercept-parameterized supply function, an electricity market clearing model is established with the objective of maximizing the long-term profit of GenCos, while considering DC power flow constraints and the formation mechanism of locational marginal prices. The intercept of the supply function is defined as the continuous decision variable of each GenCo. Then, to address the limitations of the traditional Deep Deterministic Policy Gradient (DDPG) algorithm in terms of training stability and exploration capability, the SAC algorithm is employed to construct an independent learning framework for GenCos. The use of a maximum-entropy objective enhances policy exploration and convergence robustness. A state–action–reward mapping mechanism based on profit feedback is designed according to the market clearing results, enabling each GenCo to adaptively update its strategy without explicit communication. Finally, numerical simulations are conducted on the IEEE 3-bus and 30-bus systems. The results demonstrate that the proposed SAC-based independent agent approach can effectively approximate Nash equilibrium bidding strategies, exhibits superior convergence characteristics, and reveals potential tacit collusion behavior in electricity markets under high discount factors.

    参考文献
    相似文献
    引证文献
引用本文
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-11-14
  • 最后修改日期:2026-02-26
  • 录用日期:2026-02-27
  • 在线发布日期: 2026-03-09
  • 出版日期:
文章二维码