准时
弹道
计算机科学
过程(计算)
火车
一般化
集合(抽象数据类型)
轨迹优化
控制理论(社会学)
直线(几何图形)
模拟
数学优化
工程类
人工智能
数学
天文
数学分析
物理
操作系统
地图学
运输工程
程序设计语言
地理
控制(管理)
几何学
作者
Lingbin Ning,Min Zhou,Zhuopu Hou,Rob M.P. Goverde,Fei–Yue Wang,Hairong Dong
标识
DOI:10.1109/tits.2021.3105380
摘要
This paper proposes a novel train trajectory optimization approach for high-speed railways. We restrict our attention to single train operation scenarios with different scheduled/rescheduled running times aiming at generating optimal train recommended trajectories in real time, which can ensure punctuality and energy efficiency of train operation. A learning-based approach deep deterministic policy gradient (DDPG) is designed to generate optimal train trajectories based on the offline training from the interaction between the agent and the trajectory simulation environment. An allocating running time and selecting operation modes (ARTSOM) algorithm is proposed to improve train punctuality and give a series of discrete operation modes (full traction, cruising, coasting, full braking), and thus to produce a feasible training set for DDPG, which can speed up the training process. Numerical experiments show that an optimized speed profile can be generated by DDPG within seconds on a realistic railway line. In addition, the results demonstrate the generalization ability of trained DDPG in solving TTO problems with different running times and line conditions.
科研通智能强力驱动
Strongly Powered by AbleSci AI