强化学习
计算机科学
匹配(统计)
马尔可夫决策过程
服务(商务)
人气
订单(交换)
过程(计算)
马尔可夫过程
闲置
人工智能
运筹学
工程类
社会心理学
统计
数学
操作系统
经济
经济
心理学
财务
作者
Mingyue Xu,Peng Yue,Fan Yu,Can Yang,Mingda Zhang,Shangcheng Li,Hao Li
标识
DOI:10.1080/13658816.2022.2119477
摘要
The popularity of ride-hailing platforms has significantly improved travel efficiency by providing convenient and personalized transportation services. Designing an effective ride-hailing service generally needs to address two tasks: order matching that assigns orders to available vehicles and proactive vehicle repositioning that deploys idle vehicles to potentially high-demand regions. Recent studies have intensively utilized deep reinforcement learning to solve the two tasks by learning an optimal dispatching strategy. However, most of them generate actions for the two tasks independently, neglecting the interactions between the two tasks and the communications among multiple drivers. To this end, this paper provides an approach based on multi-agent deep reinforcement learning where the two tasks are modeled as a unified Markov decision process, and the colossal state space and competition among drivers are addressed. Additionally, a modifiable agent-specific state representation is proposed to facilitate knowledge transferring and improve computing efficiency. We evaluate our approach on a public taxi order dataset collected in Chengdu, China, where a variable number of simulated vehicles are tested. Experimental results show that our approach outperforms seven existing baselines, reducing passenger rejection rate, driver idle time and improving total driver income.
科研通智能强力驱动
Strongly Powered by AbleSci AI