强化学习
计算机科学
过境(卫星)
直线(几何图形)
控制(管理)
偏爱
服务(商务)
公共交通
国家(计算机科学)
实时计算
运输工程
分布式计算
工程类
人工智能
数学
算法
统计
几何学
经济
经济
标识
DOI:10.1016/j.trc.2023.104309
摘要
Bus bunching is a long-standing problem in transit operation and ruining the regularity of transit service. In a typical urban transit network setting of multiple lines with a shared corridor, bus bunching becomes more frequent as there is more uncertainty inside the shared corridor. While multi-agent reinforcement learning (MARL) has been a promising scheme for learning efficient control policy in a multi-agent system, few studies have explored its applicability in multi-line transit control scenarios. In this study, we focus on a basic transit network where there are two bus lines with a shared corridor. An efficient MARL framework is proposed to learn multi-line bus holding control to avoid bus bunching. Specifically, we design observation and reward functions that incorporate multi-line information. In addition, a preference weights producer is introduced to update the objective weights towards a good trajectory evaluation during daily transit operation. In this way, we handle the multi-objective issue in multi-line control. In experimental studies, we validate the superiority of the method in real-world bus lines. Results show that the state and reward augmented with multi-line information benefit MARL in multi-line bus control. Besides, by updating preference weights towards less passenger waiting time, the regularity of transit service is further improved.
科研通智能强力驱动
Strongly Powered by AbleSci AI