基于近端策略优化的动态武器目标分配
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

E91;TP18

基金项目:

北京市自然科学基金面上项目(4252050);国家自然科学基金青年科学基金项目(A类)(62425304);国家自然科学基金基础科学中心项目(62088101).


Dynamic weapon-target assignment based on proximal policy optimization
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    现代战场环境下的动态传感器-武器-目标分配(SWTA)问题具有高动态、强对抗的特点, 传统静态分配方法难以适应战场态势的快速演化, 存在求解效率低、环境适应性差等局限. 鉴于此, 提出一种基于近端策略优化(PPO)的动态SWTA方法, 融合OODA(观察-判断-决策-行动)循环理论, 构建符合实际作战场景的传感器探测概率模型与武器毁伤概率模型, 通过PPO算法实现智能体与环境的持续交互与策略优化, 在决策过程中统筹作战效能与资源消耗. 实验结果表明, 该方法在多种弹药目标比场景下均表现出优越性能, 能够显著提升系统整体作战的效能与资源利用率. 所提出方法为动态SWTA问题提供了一种高效、自适应的智能决策框架, 推动了指挥决策的智能化进程, 具备较强的实际应用潜力.

    Abstract:

    The dynamic sensor-weapon-target assignment (SWTA) problem in modern battlefield environments is characterized by high dynamism and strong adversariality. Traditional static assignment methods struggle to adapt to the rapidly evolving battlefield situation due to their low solving efficiency and inadequate environmental adaptability. To address these challenges, this paper proposes a dynamic SWTA method based on proximal policy optimization (PPO). By integrating the OODA (observe–orient–decide–act) loop theory, the method constructs realistic sensor detection probability and weapon kill probability models. Through continuous interaction between the agent and the environment, the PPO algorithm optimizes the strategy while balancing operational effectiveness and resource consumption. Experimental results demonstrate that the proposed method achieves superior performance across various ammo-to-target ratio scenarios, significantly enhancing both resource utilization and overall operational effectiveness. This study provides an efficient and adaptive intelligent decision-making framework for the dynamic SWTA problem, advancing the process of command and decision-making, with strong potential for practical application.

    参考文献
    相似文献
    引证文献
引用本文

王晴,王浩然,辛斌,等.基于近端策略优化的动态武器目标分配[J].控制与决策,2026,41(4):919-930

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-09-01
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-03-24
  • 出版日期: 2026-04-10
文章二维码