强化学习
解耦(概率)
匹配(统计)
计算机科学
订单(交换)
食物运送
人工智能
按需
数学优化
机器学习
数学
工程类
业务
营销
控制工程
统计
多媒体
财务
作者
Jingfang Chen,Ling Wang,Zixiao Pan,Yuting Wu,Jie Zheng,Xuetao Ding
出处
期刊:Tsinghua Science & Technology
[Tsinghua University Press]
日期:2023-09-21
卷期号:29 (2): 386-399
被引量:5
标识
DOI:10.26599/tst.2023.9010069
摘要
The on-demand food delivery (OFD) service has gained rapid development in the past decades but meanwhile encounters challenges for further improving operation quality. The order dispatching problem is one of the most concerning issues for the OFD platforms, which refer to dynamically dispatching a large number of orders to riders reasonably in very limited decision time. To solve such a challenging combinatorial optimization problem, an effective matching algorithm is proposed by fusing the reinforcement learning technique and the optimization method. First, to deal with the large-scale complexity, a decoupling method is designed by reducing the matching space between new orders and riders. Second, to overcome the high dynamism and satisfy the stringent requirements on decision time, a reinforcement learning based dispatching heuristic is presented. To be specific, a sequence-to-sequence neural network is constructed based on the problem characteristic to generate an order priority sequence. Besides, a training approach is specially designed to improve learning performance. Furthermore, a greedy heuristic is employed to effectively dispatch new orders according to the order priority sequence. On real-world datasets, numerical experiments are conducted to validate the effectiveness of the proposed algorithm. Statistical results show that the proposed algorithm can effectively solve the problem by improving delivery efficiency and maintaining customer satisfaction.
科研通智能强力驱动
Strongly Powered by AbleSci AI