基于多目标深度强化学习的需求响应式列车时刻表优化

doi:10.13195/j.kzyjc.2025.1334

首页 > 过刊浏览>年第0卷第期 >. DOI:10.13195/j.kzyjc.2025.1334

基于多目标深度强化学习的需求响应式列车时刻表优化
DOI:
                        10.13195/j.kzyjc.2025.1334
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:兰州交通大学交通运输学院
作者简介:
通讯作者:
中图分类号:U292.4
基金项目:国家自然科学基金项目

Optimization of demand-responsive train timetabling based on multi-objective deep reinforcement learning

Author:

Affiliation:

Fund Project:

The National Natural Science Foundation of China

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

客流在时空维度上呈现的时变特性对城市轨道交通运营管理提出了严峻挑战,同时,需求响应式列车时刻表优化问题的复杂性对于算法设计提出了更高的要求. 为此,本文将人工智能领域的深度强化学习方法应用到地铁列车时刻表的优化问题中,以提升地铁运营管理的智能化水平. 将时变客流需求与列车时刻表决策互动关系构建为马尔可夫决策过程,为智能体提供训练和学习环境. 其中,以列车到发、乘客状态以及列车数量作为状态空间,以列车发车间隔作为动作空间,并设计了“人-车-站”一体化的多维复合奖励函数. 开发了一种基于自适应发车间隔和列车数量的多目标软演员-评论家优化算法提升求解效率. 以小规模算例进行超参数优化,并验证了需求响应式列车时刻表相对于均衡列车时刻表的优势. 以广州市地铁8号线进行仿真实验,结果表明,所提出的方法相对于其他人工智能方法及启发式算法具有较快的收敛速度和求解效率. 此外,针对不同客流扰动场景,方法能够在短时间内生成满意的运营方案,说明方法具有较好的泛化能力. 研究结果可为进一步提升地铁运营调度智能化水平提供理论和方法支撑.

Abstract:

The time-varying characteristics of passenger flow in spatial and temporal dimensions impose significant challenges on urban rail transit operation management, while the complexity of demand-responsive train timetabling calls for more advanced algorithmic solutions. To address this, this paper introduces a deep reinforcement learning (DRL) approach to optimize subway train schedules, thereby improving the intelligence of operational management. The dynamic interaction between time-varying passenger demand and train timetable is formulated as a Markov decision process, which serves as a training environment for the learning agent. The state space includes train arrivals/departures, passenger states, and train counts, the action space is defined by dispatch intervals. A multi-dimensional reward function jointly considers passengers, trains, and stations. A multi-objective Soft Actor-Critic algorithm with adaptive dispatch intervals and train numbers is developed for efficient optimization. Hyper-parameter tuning via a small-scale case and confirms that demand-responsive timetable outperforms fixed-interval timetable. Simulations on Guangzhou Metro Line 8 show faster convergence and higher solution efficiency compared to other AI and heuristic methods. Moreover, under various passenger flow disturbance scenarios, the method can generate satisfactory operation plans in a short period of time, demonstrating its strong generalization capability. These results provide theoretical and methodological support for intelligent subway scheduling.

参考文献

相似文献

引证文献

引用本文

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-12-25
最后修改日期:2026-04-03
录用日期:2026-04-05
在线发布日期: 2026-04-14
出版日期:

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码