基于可配置CFR的海上基地防护安全博弈策略求解

doi:10.13195/j.kzyjc.2024.1172

首页 > 过刊浏览>2025年第40卷第8期 >2503-2512. DOI:10.13195/j.kzyjc.2024.1172

基于可配置CFR的海上基地防护安全博弈策略求解
DOI:
                        10.13195/j.kzyjc.2024.1172
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:TP273
基金项目:国家自然科学基金项目(61702528).

Configurable CFR for strategy solving of security game in maritime base protectioon

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

围绕海上基地的攻防可看作一个多阶段序贯对抗过程, 通常可建模为不完美信息零和博弈. 针对海上基地防护安全博弈问题, 构建不完美信息序贯博弈模型, 分析博弈模型各要素; 围绕近似纳什均衡策略的快速求解, 提出可配置反事实遗憾最小化(CogCFR)算法, 利用基类CFR算法与元控制器可动态控制CFR的超参数; 以海上多个海上基地防护为试验背景, 利用CogCFR求解海上基地防护资源分配策略. 针对有限理性对手, 提出考虑约束的单侧信任域鲁棒对手利用策略更新方式. 实验结果表明: 可配置反事实遗憾最小化相比动态加权反事实遗憾最小化计算时效性更强、参数更少; 算法具有较好的应用可行性和领域泛化性, 可为序贯交互类博弈对抗问题策略求解提供参考.

Abstract:

The offensive and counterattack around the maritime island can be regarded as a multi-stage sequential counterattack process, which is usually modeled as a zero-sum game with imperfect information, the game model elements are analysized. Aiming at the security game problem of maritime base protection, the imperfect information sequential game model is constructed. A configurable counterfactual regret (CogCFR) minimization algorithm based on base CFR variants and meta-controller is proposed to solve the approximate Nash equilibrium strategy quickly, which can dynamically control the CFR hyperparameters. For the experimental background of multiple maritime island protection, the CogCFR minimization algorithm is used to solve the resource allocation strategy of maritime island protection. This paper presents a robust opponent exploitation strategy updating method with unilateral trust region considering constraints for bounded rational opponent. The experimental results show that the CogCFR minimization with meta-learning is more efficient and has fewer parameters than the dynamic weighted CFR minimization. The algorithm has good application feasibility and domain generalization, and can provide reference for strategy solving of sequential interactive game.

参考文献

相似文献

引证文献

引用本文

罗俊仁,张万鹏,谷学强,等.基于可配置CFR的海上基地防护安全博弈策略求解[J].控制与决策,2025,40(8):2503-2512

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-10-04
最后修改日期:
录用日期:
在线发布日期: 2025-07-11
出版日期: 2025-08-20

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码