基于Transformer-DRL的机坪特种车群调度策略研究

doi:10.13195/j.kzyjc.2024.0918

首页 > 过刊浏览>2025年第40卷第6期 >1939-1949. DOI:10.13195/j.kzyjc.2024.0918

基于Transformer-DRL的机坪特种车群调度策略研究
DOI:
                        10.13195/j.kzyjc.2024.0918
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:V351+.3；TP181
基金项目:天津市教委自然科学科研基金项目(2018KJ237).

Research on scheduling strategy of special vehicle cluster on apron based on Transformer-DRL

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对机坪环境下多种类地面服务车辆的协同调度这一复杂的优化任务, 提出一种结合Transformer架构的深度强化学习算法. 首先, 依据航班地面服务流程的不同优先级, 将整个地面服务任务进行分解, 进而将原本复杂的多类型车辆调度问题转化为有先后顺序的单类型车辆调度问题; 接着, 利用Transformer架构对航班和车辆的特征进行自动提取, 通过解码器按序列逐步求解任务调度, 结合贪婪算法和蒙特卡洛模拟算法分别生成初步调度策略, 并将这些策略应用于每个子问题的求解过程中; 在此基础上, 利用深度强化学习算法对整个模型进行训练, 通过智能体与环境的交互来不断优化调度策略; 此外, 为了提升模型的鲁棒性和应对复杂情况的能力, 通过扩充真实数据集进行模型训练. 大量的实验结果证明, 基于Transformer架构的深度强化学习方法能够有效避免不同种类车辆之间的相互干扰, 并很好地应对真实环境下的航班调度需求.

Abstract:

Aiming at the complex optimization task of collaborative scheduling of multiple types of ground service vehicles in the ramp environment, this paper proposes a deep reinforcement learning algorithm integrated with the Transformer architecture. First, the entire ground service task is decomposed based on the varying priorities of the flight ground service process, transforming the complex multi-type vehicle scheduling problem into a sequential single-type vehicle scheduling problem. The Transformer architecture is then employed to automatically extract the features of flights and vehicles, and task scheduling is solved step by step through the decoder. Preliminary scheduling strategies are generated by combining greedy and Monte Carlo simulation algorithms, which are applied to each sub-problem. On this basis, a deep reinforcement learning algorithm is used to train the entire model, continuously optimizing the scheduling strategy through the interaction between the agent and the environment. Further, to enhance the model’s robustness and ability to handle complex situations, the model is trained by expanding the real dataset. Extensive experiments demonstrate that the deep reinforcement learning approach based on the Transformer architecture effectively prevents mutual interference among different vehicle types and can meet the flight scheduling requirements in real-world environments.

参考文献

相似文献

引证文献

引用本文

陈维兴,李晨辉,李业波.基于Transformer-DRL的机坪特种车群调度策略研究[J].控制与决策,2025,40(6):1939-1949

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-07-30
最后修改日期:
录用日期:
在线发布日期: 2025-04-30
出版日期: 2025-06-20

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码