计算机科学
强化学习
过程(计算)
马尔可夫过程
实时计算
增强学习
火车
北京
调度(生产过程)
运筹学
地铁列车时刻表
任务(项目管理)
数学优化
马尔可夫决策过程
钢筋
人工智能
工程类
运营管理
操作系统
统计
中国
法学
系统工程
地理
地图学
数学
政治学
作者
Yin Wang,Yisheng Lv,Jianying Zhou,Zhiming Yuan,Qi Zhang,Min Zhou
出处
期刊:International Conference on Intelligent Transportation Systems
日期:2021-09-19
被引量:1
标识
DOI:10.1109/itsc48978.2021.9564980
摘要
In the daily management of high-speed railway systems, the train timetable rescheduling problem with unpredictable disturbances is a challenging task. The large number of stations and trains leads to a long-time consumption to solve the rescheduling problem, making it difficult to meet the realtime requirements in real-world railway networks. This paper proposes a policy-based reinforcement learning approach to address the high-speed railway timetable rescheduling problem, in which the agent minimizes the total delay by adjusting the departure sequence of all trains along the railway line. A two-stage Markov Decision Process model is established to model the environment where states, actions, and reward functions are designed. The proposed method contains an offline learning process and an online application process, which can give the optimal rescheduling schedule based on the current state immediately. Numerical experiments are performed over two different delay scenarios on the Beijing-Shanghai high-speed railway line. The simulation results show that our approach can find a high-quality rescheduling strategy within one second, which is superior to the First-Come-First-Served (FCFS) and First-Scheduled-First-Served (FSFS) methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI