出租车
计算机科学
强化学习
马尔可夫决策过程
闲置
任务(项目管理)
过程(计算)
极限(数学)
部分可观测马尔可夫决策过程
马尔可夫过程
运筹学
马尔可夫链
人工智能
运输工程
机器学习
马尔可夫模型
工程类
数学分析
统计
数学
系统工程
操作系统
作者
Hao Yu,Xi Guo,Jie Chen,Xiao Luo
标识
DOI:10.1007/978-3-031-46677-9_39
摘要
Ride-hailing apps, such as Didi and Uber, allow people to easily request a ride by inputting their desired origin and destination locations. Due to transportation system complexity and vast city areas, uneven distribution of vehicle supply versus rider demand frequently occurs. This can lead to overcrowded areas with insufficient taxis or sparsely populated zones with abundant empty taxis. To balance supply and demand, many studies have proposed taxi-repositioning methods. However, recent studies limit to repositioning taxis in the one-to-one manner. In this paper, we propose the M2MTR method that can reposition idle taxis in the many-to-many manner. We define the reposition task as a partially observable Markov decision process and define the optimization objectives. To find good reposition strategies, we propose the M2MTR method that is a variation of a multi-agent cooperative A2C method. To make models converge quickly, we design the rewards delicately. To update the policy networks efficiently, we design a local reward combiner. We build an environment simulator to train and evaluate M2MTR. Extensive experiments on real datasets show that M2MTR outperforms other three baseline algorithms. The reposition strategies obtained from M2MTR can make supply and demand more balance, can increase the response rates, and can reduce response time of taxis.
科研通智能强力驱动
Strongly Powered by AbleSci AI