旅行商问题
强化学习
时间范围
计算机科学
2-选项
数学优化
常量(计算机编程)
人工智能
算法
数学
程序设计语言
作者
WU Guojin,Zizhen Zhang,Hong Liu,Jiahai Wang
标识
DOI:10.1109/smc52423.2021.9658956
摘要
Traveling Salesman Problem (TSP) is a well-known NP-hard combinatorial optimization problem. Recently, many researchers have used deep reinforcement learning to solve it. However, traffic factors are rarely considered in their works, in which the traveling time between customer locations is assumed to be constant over the planning horizon. For many practical scenarios, the traffic conditions between customer locations may change over time due to the impact of traffic patterns. Thus, this paper considers a Time-Dependent Traveling Salesman Problem with Time Windows (TDTSPTW), where the time dependency is obtained by fitting the collected traffic data into real-time traffic function with the interpolation method. We propose a deep reinforcement learning framework to solve TDTSPTW. Extensive experiments on TDTSPTW instances indicate that the proposed method can capture the real-time traffic changes and yield high-quality solutions within a very short time, compared with other typical baseline algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI