强化学习
计算机科学
马尔可夫决策过程
北京
公共交通
启发式
交通拥挤
马尔可夫过程
对策
运筹学
运输工程
实时计算
工程类
人工智能
统计
数学
法学
政治学
中国
航空航天工程
作者
Haoyang Yan,Zhiyong Cui,Xinqiang Chen,Xiaolei Ma
出处
期刊:IEEE Transactions on Industrial Informatics
[Institute of Electrical and Electronics Engineers]
日期:2022-03-11
卷期号:19 (1): 469-479
被引量:19
标识
DOI:10.1109/tii.2022.3158651
摘要
As a primary countermeasure to mitigate traffic congestion and air pollution, promoting public transit has become a global census. Designing a robust and reliable bus timetable is a pivotal step to increase ridership and reduce operating cost for transit authorities. However, most previous studies on bus timetabling rely on historical passenger count and travel time data to generate static schedules, which often yield biased results in these uncertain scenarios, such as demand surge or adverse weather. In addition, acquiring real-time passenger origin/destination from a limited number of running buses is not feasible. This article considers the multiline dynamic bus timetable optimization problem as a Markov decision process model to address the aforementioned issues, and proposes a multiagent deep reinforcement learning framework to ensure effective learning from the imperfect-information game, where the passenger demand and traffic condition are not always known in advance. Moreover, a distributed reinforcement learning algorithm is applied to overcome the limitation of high computational cost and low efficiency. A case study of multiple bus lines in Beijing, China, confirms the effectiveness and efficiency of the proposed model. The results demonstrate that our method outperforms heuristic and state-of-the-art reinforcement learning algorithms by reducing 20.30% of operating and passenger costs compared with actual timetables.
科研通智能强力驱动
Strongly Powered by AbleSci AI