基于深度强化学习的模糊作业车间调度问题

doi:10.13195/j.kzyjc.2022.1345

首页 > 过刊浏览>2024年第39卷第2期 >595-603. DOI:10.13195/j.kzyjc.2022.1345

基于深度强化学习的模糊作业车间调度问题
DOI:
                        10.13195/j.kzyjc.2022.1345
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:新疆大学 电气工程学院,乌鲁木齐 830047
作者简介:
通讯作者:E-mail: zhlxju@163.com.
中图分类号:TP18
基金项目:国家自然科学基金项目(51967019,52065064).

Fuzzy job shop scheduling problem based on deep reinforcement learning

Author:

Affiliation:

College of Electrical Engineering,Xinjiang University,Urumqi 830047,China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对具有模糊加工时间和模糊交货期的作业车间调度问题,以最小化最大完工时间为目标,以近端策略优化(PPO)算法为基本优化框架,提出一种LSTM-PPO(proximal policy optimization with Long short-term memory)算法进行求解.首先,设计一种新的状态特征对调度问题进行建模,并且依据建模后的状态特征直接对工件工序进行选取,更加贴近实际环境下的调度决策过程;其次,将长短期记忆(LSTM)网络应用于PPO算法的行动者-评论者框架中,以解决传统模型在问题规模发生变化时难以扩展的问题,使智能体能够在工件、工序、机器数目发生变化时,仍然能够获得最终的调度解.在所选取的模糊作业车间调度的问题集上,通过实验验证了该算法能够取得更好的性能.

Abstract:

For the job shop scheduling problem with fuzzy processing time and fuzzy delivery time, this paper uses the proximal policy optimization(PPO) algorithm as the basic optimization framework with the objective of minimizing the maximum completion time. An LSTM-PPO(proximal policy optimization with long short-term memory) algorithm is proposed to solve the problem. Firstly, a new state feature is designed to model the scheduling problem, and the process is selected directly based on the modeled state feature, which is closer to the actual scheduling decision process. Them, the long short-term memory(LSTM) network is applied to the actor-commentator framework of the PPO algorithm, which solves the problem that the traditional model is difficult to scale up when the problem size changes, and enables the intelligent body to obtain the final scheduling solution even when the number of workpieces, processes, and machines changes. On the selected problem set of fuzzy job shop scheduling, it is experimentally verified that the algorithm can achieve better performance.

参考文献

相似文献

引证文献

引用本文

朱家政,张宏立,王聪,等.基于深度强化学习的模糊作业车间调度问题[J].控制与决策,2024,39(2):595-603

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2024-01-18
出版日期: 2024-02-20

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

分享

文章指标

历史

文章二维码