基于多智能体深度强化学习的轨道车辆组装分布式异构柔性作业调度

doi:10.13195/j.kzyjc.2025.0857

首页 > 过刊浏览>2026年第41卷第5期 >1219-1228. DOI:10.13195/j.kzyjc.2025.0857

基于多智能体深度强化学习的轨道车辆组装分布式异构柔性作业调度
DOI:
                        10.13195/j.kzyjc.2025.0857
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:TP18
基金项目:国家自然科学基金项目(52405220)；四川省自然科学基金项目(2024ZHCG0028).

Distributed heterogeneous flexible job shop scheduling for railway vehicle assembly using multi-agent deep reinforcement learning

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对轨道车辆组装作业中多车型混线生产、工序复杂、工艺路线差异显著及制造资源高度异构带来的分布式异构柔性作业车间调度挑战, 提出一种两阶段多智能体深度强化学习方法. 将调度流程建模为多阶段马尔可夫决策过程, 决策涵盖工件分配、工序排序和机器选择, 通过奖励设计引导智能体最小化全局最大完工时间. 上层智能体基于分层异构图注意力网络提取产线全局状态, 实现工件在不同组装线或工区间的合理分配与负载均衡; 下层智能体采用双智能体协作策略, 利用基于图神经网络的编码器-解码器结构捕捉工序间前后约束及资源占用等依赖关系, 实现局部优化. 基于实际作业场景数据, 通过计算验证该方法在缩短制造周期方面的有效性, 展现出良好的泛化能力.

Abstract:

A two-stage multi-agent deep reinforcement learning method is proposed to address the scheduling challenge of the distributed heterogeneous flexible job shop, posed by multi-model mixed-flow production, complex processes, significant process route variations, and highly heterogeneous manufacturing resources in rail vehicle assembly operations. The scheduling process is modeled as a multi-stage Markov decision process, where decisions encompass job allocation, operation sequencing, and machine selection, and agents are guided by reward design to minimize the global makespan. The upper-level agent, based on a hierarchical heterogeneous graph attention network, extracts the global state of the production line to achieve reasonable job allocation and load balancing across different assembly lines or work zones. The lower-level agent utilizes a dual-agent collaboration strategy and an encoder-decoder structure based on a graph neural network to capture dependencies such as precedence constraints between operations and resource occupancy, enabling local optimization. Based on data from actual operational scenarios, the effectiveness of the proposed method in shortening the manufacturing cycle is validated computationally, and it exhibits good generalization capability.

参考文献

相似文献

引证文献

引用本文

孟祥恒,郭鹏,李嘉雯,等.基于多智能体深度强化学习的轨道车辆组装分布式异构柔性作业调度[J].控制与决策,2026,41(5):1219-1228

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-08-20
最后修改日期:
录用日期:
在线发布日期: 2026-04-17
出版日期: 2026-05-10

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码