基于DQN的多类型拦截装备复合式反无人机任务分配方法

doi:10.13195/j.kzyjc.2020.0787

首页 > 过刊浏览>2022年第37卷第1期 >142-150. DOI:10.13195/j.kzyjc.2020.0787

基于DQN的多类型拦截装备复合式反无人机任务分配方法
DOI:
                        10.13195/j.kzyjc.2020.0787
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:国防科技大学 系统工程学院,长沙 410073
作者简介:
通讯作者:E-mail: cgq299@nudt.edu.cn.
中图分类号:TP273
基金项目:国家自然科学基金项目(62073333)；装备发展部领域基金项目(61403120206).

Task assignment method of compound anti-drone based on DQN for multi type interception equipment

Author:

Affiliation:

College of Systems Engineering,National University of Defense Technology,Changsha 410073,China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对当前反无人系统无法有效压制无人机的问题,使用多种拦截装备构建一种新的反无人机方法.传统多目标优化算法无法解决动态的任务分配问题,对此,提出一种基于深度Q网络(DQN)的多类型拦截装备复合式反无人机任务分配模型.DQN模块对任务分配问题进行初期决策.为了提高算法收敛速度和学习效率,该方法未采用下一时刻的状态来预测Q值,而是采用当前时刻的状态来预测Q值,消除训练过程中Q值过估计的影响.之后采用进化算法对决策结果进行优化,输出多个拦截方案.以国内某机场跑道周围区域开阔地为防护对象,构建反无人机系统的任务分配仿真环境,仿真结果验证了所提出方法的有效性.同时,将DQN与Double DQN方法相比,所提出改进DQN算法训练的智能体表现更为精确,并且算法的收敛性和所求解的表现更为优异.所提出方法为反无人机问题提供了新的思路.

Abstract:

Aiming at the problem that the current anti-drone system can not effectively suppress the drone, a new compound anti-drone method is constructed by using multiple types of intercepting equipment. The traditional multi-objective optimization algorithm cannot solve the dynamic task allocation problem, this paper proposes a task assignment model of multi-type interception equipment compound anti-drone based on a deep Q network (DQN). The DQN module makes initial decisions on task allocation issues. In order to improve the convergence speed and learning efficiency of the algorithm, this method does not use the state of the next time to predict the Q value, but uses the state of the current time to predict the Q value, while eliminating the influence of over estimation of $ Q $ value in the training process. After that, an evolutionary algorithm is used to optimize the decision-making results and output multiple interception schemes. The simulation environment of task assignment of the anti-drone system is constructed by taking the open area around a domestic airport runway as the protection object. The simulation results verify the effectiveness of this method. At the same time, compared with the DQN and Double DQN methods, the improved DQN algorithm training agent performance is more accurate, and the convergence of the algorithm and the performance of the solution are more excellent. The proposed method provides new ideas for the anti-drones problem.

参考文献

相似文献

引证文献

引用本文

黄亭飞,程光权,黄魁华,等.基于DQN的多类型拦截装备复合式反无人机任务分配方法[J].控制与决策,2022,37(1):142-150

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2021-12-10
出版日期: 2022-01-20

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

分享

文章指标

历史

文章二维码