基于强化学习的无线自组网络多节点干扰策略
CSTR:
作者:
作者单位:

(1. 国防科技大学电子对抗学院,合肥230037;2. 安徽省电子制约技术重点实验室,合肥230037)

作者简介:

颛孙少帅(1990-), 博士生, 从事认知干扰和强化学习的研究;杨俊安(1965-), 教授, 从事信号处理和智能计算等研究.

通讯作者:

E-mail: zhuansunss@sina.com

中图分类号:

TP972

基金项目:

安徽省自然科学基金项目(1308085QF99, 1408085MKL46).


Multi-nodes jamming strategy in wireless Ad hoc network based on reinforcement learning
Author:
Affiliation:

(1. College of Electronic Countermeasure,National University of Defense Technology,Hefei230037,China;2. Key Laboratory of Electronic Restriction,Hefei230037,China)

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为了实现无线自组网络通信拒止的干扰需求,构建无线自组网络模型,并针对该模型提出一种基于强化学习理论的未知拓扑网络多节点干扰策略选择算法,以实时交互的方式进行在线学习.该算法无需获悉网络拓扑等先验知识,仅以网络流数目作为反馈信息,以多节点联合干扰的方式逐步学习最佳干扰节点.在不同参数的无线自组网中的仿真结果表明,所提算法在累积阻断网络流方面优于现有算法,且在新的奖赏标准下,所提算法仍具有优异的干扰性能.

    Abstract:

    An unknown topology network interdiction strategy based on reinforcement learning is proposed. When the model of the wireless Ad hoc network is established, the proposed interdiction strategy could fulfill the needs of interdicting information transmission by jamming multi nodes and enable the jammer to interdict communication in the underlying network in real time manner. By jamming multi nodes as an operation style and counting stopped network flows as action feedback, the proposed strategy could learn the better nodes to jam without a priori knowledge of the network topology. Simulation results on the established wireless Ad hoc network with various parameters show that, the proposed interdiction strategy has a better performance in accumulate stopped flows than existing algorithms, and still has excellent jamming capability under the proposed new reward standard.

    参考文献
    相似文献
    引证文献
引用本文

颛孙少帅,杨俊安,刘辉,等.基于强化学习的无线自组网络多节点干扰策略[J].控制与决策,2018,33(7):1199-1206

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2018-07-03
  • 出版日期:
文章二维码