旅行商问题
计算机科学
皮卡
强化学习
数学优化
旅行购买者问题
2-选项
常量(计算机编程)
瓶颈旅行商问题
人工智能
服务(商务)
运筹学
数学
算法
经济
图像(数学)
经济
程序设计语言
作者
Zizhen Zhang,Hong Liu,MengChu Zhou,Jiahai Wang
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2023-04-01
卷期号:34 (4): 2119-2132
被引量:75
标识
DOI:10.1109/tnnls.2021.3105905
摘要
A traveling salesman problem (TSP) is a well-known NP-complete problem. Traditional TSP presumes that the locations of customers and the traveling time among customers are fixed and constant. In real-life cases, however, the traffic conditions and customer requests may change over time. To find the most economic route, the decisions can be made constantly upon the time-point when the salesman completes his service of each customer. This brings in a dynamic version of the traveling salesman problem (DTSP), which takes into account the information of real-time traffic and customer requests. DTSP can be extended to a dynamic pickup and delivery problem (DPDP). In this article, we ameliorate the attention model to make it possible to perceive environmental changes. A deep reinforcement learning algorithm is proposed to solve DTSP and DPDP instances with a size of up to 40 customers in 100 locations. Experiments show that our method can capture the dynamic changes and produce a highly satisfactory solution within a very short time. Compared with other baseline approaches, more than 5% improvements can be observed in many cases.
科研通智能强力驱动
Strongly Powered by AbleSci AI