基于深度强化学习的资源受限条件下的DIDS任务调度优化方法

doi:10.13195/j.kzyjc.2021.0448

首页 > 过刊浏览>2022年第37卷第11期 >3052-3057. DOI:10.13195/j.kzyjc.2021.0448

基于深度强化学习的资源受限条件下的DIDS任务调度优化方法
DOI:
                        10.13195/j.kzyjc.2021.0448
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:1. 西安建筑科技大学 管理学院,西安 710055;2. 西安工程大学 电子信息学院,西安 710048;3. 西安工程大学 人文学院,西安 710048;4. 陕西省社会科学院,西安 710061
作者简介:
通讯作者:E-mail: huangnan93@163.com.
中图分类号:TP393.08；C931.2
基金项目:国家自然科学基金项目(71874134)；西安市科技计划项目(21XJZZ0024)；陕西省教育厅专项项目(20JX014).

An optimization method for DIDS task scheduling under resource- constrained conditions based on deep reinforcement learning

Author:

Affiliation:

1. College of Management,Xián University of Architecture and Technology,Xián 710055,China;2. School of Electronic and Information,Xián Polytechnic University,Xián 710048,China;3. College of Humanities,Xián Polytechnic University,Xián 710048,China;4. Shaanxi Academy of Social Sciences,Xián 710061,China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

在节点性能有限的边缘计算环境下进行分布式入侵检测系统(distributed intrusion detection system,DIDS)的任务分配,是一种典型的资源受限任务调度问题.针对该问题,提出基于深度强化学习的DIDS低负载任务调度方案.该方案将任务调度过程描述为马尔科夫决策过程(Markov decision process,MDP)并建立模型的相关空间和价值函数,找到保持DIDS低负载状态的最优策略.针对状态和动作空间过大且高维连续的问题,提出通过深度循环神经网络进行函数拟合.实验表明,所提出方案可使DIDS在网络变化中动态调节调度策略,保持系统整体的低负载,而安全指标没有明显降低.

Abstract:

The task assignment of distributed intrusion detection systems(DIDS) in the edge computing environment with limited node performance is a typical resource-constrained task scheduling problem. To solve this problem, a DIDS low-load task scheduling scheme based on deep reinforcement learning is proposed. The task scheduling process is first described as a Markov decision process and the relevant space and value function of the model are established to find the optimal strategy for maintaining the low-load state of the DIDS. To solve the problem of excessively large action space and high-dimensional continuity, a deep recurrent neural network is proposed to perform function fitting. The experimental results show that the proposed scheme enables the DIDS to dynamically adjust the scheduling strategy during network changes, keeping the overall system load low, and the safety indicators are not significantly reduced.

参考文献

相似文献

引证文献

引用本文

赵旭,黄光球,江晋,等.基于深度强化学习的资源受限条件下的DIDS任务调度优化方法[J].控制与决策,2022,37(11):3052-3057

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2022-09-30
出版日期: 2022-11-20

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

分享

文章指标

历史

文章二维码