面向有限通信约束下多智能体对抗的团队博弈决策研究
CSTR:
作者:
作者单位:

1.西北工业大学;2.西北机电工程研究所

作者简介:

通讯作者:

中图分类号:

TP18

基金项目:

国家自然科学基金项目(面上项目,重点项目,重大项目)


Research on Team Game Decision-Making for Multi-agent Confrontation under Limited Communication Constraints
Author:
Affiliation:

Fund Project:

The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    随着无人系统平台的快速发展,无人集群对抗是未来体系能力发展与建设的关键.然而,在应对集群目标时,集群间的对抗博弈加剧了有限通信约束下通信带宽压力、链路动态变化和信息时延积累等问题,集群对抗决策面临态势信息过载、协同能力下降与团队响应滞后的挑战. 针对上述问题,首先构建了多智能体对抗的运动学模型与对抗损耗模型,并在此基础上提出了有限通信约束下多智能体对抗的团队博弈建模与求解方法.通过设计小世界网络通信的团队联盟策略,将有限通信约束引入多智能体对抗的团队博弈模型,给出了基于关联团队最大最小均衡解概念的团队博弈均衡定义与存在性条件.然后,基于异构智能体强化学习算法设计了交替冻结训练的团队博弈均衡求解算法.最后,仿真结果表明,所提出的博弈模型与交替冻结训练框架能够获得稳定的策略剖面,实现了有限通信条件下的集群自适应动态决策与对抗策略优化.

    Abstract:

    With the rapid development of unmanned system platforms, unmanned swarm confrontation has become crucial for the future development and construction of system capabilities. However, when engaging swarm targets, adversarial interactions between swarms exacerbate issues such as communication bandwidth pressure, dynamic link variations, and accumulated information delays under limited communication constraints. Swarm confrontation decision-making faces challenges including overloaded situational information, degraded collaborative capabilities, and delayed team responses. To address these problems, this study first establishes kinematic models and attrition models for multi-agent confrontation, and subsequently proposes a team game modeling and solution approach for multi-agent confrontation under limited communication constraints. By designing a team alliance strategy based on small-world network communication, limited communication conditions are incorporated into the team game model for multi-agent confrontation. The definition and existence conditions of team game equilibrium are provided, based on the concept of correlated team max-min equilibrium solutions. Then, a heterogeneous multi-agent reinforcement learning algorithm is developed to design an alternating freeze-training method for solving the team game equilibrium. Finally, the simulation results demonstrate that the proposed game model and alternating freezing training framework can obtain stable strategy profiles, achieving adaptive dynamic decision-making and adversarial strategy optimization under limited communication conditions.

    参考文献
    相似文献
    引证文献
引用本文
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-11-17
  • 最后修改日期:2026-03-24
  • 录用日期:2026-03-25
  • 在线发布日期: 2026-04-17
  • 出版日期:
文章二维码