车辆路径问题
计算机科学
水准点(测量)
强化学习
背景(考古学)
一般化
时间复杂性
数学优化
GSM演进的增强数据速率
残余物
人工神经网络
图形
算法
人工智能
布线(电子设计自动化)
数学
理论计算机科学
计算机网络
古生物学
数学分析
大地测量学
生物
地理
作者
Kun Lei,Peng Guo,Yi Wang,Xiao Qin Wu,Wenchao Zhao
标识
DOI:10.1016/j.neucom.2022.08.005
摘要
For NP-hard combinatorial optimization problems, it is usually challenging to find high-quality solutions in polynomial time. Designing either an exact algorithm or an approximate algorithm for these problems often requires significantly specialized knowledge. Recently, deep learning methods have provided new directions to solve such problems. In this paper, an end-to-end deep reinforcement learning framework is proposed to solve this type of combinatorial optimization problems. This framework can be applied to different problems with only slight changes of input, masks, and decoder context vectors. The proposed framework aims to improve the models in literacy in terms of the neural network model and the training algorithm. The solution quality of TSP and the CVRP up to 100 nodes are significantly improved via our framework. Compared with the best results of the state-of-the-art methods, the average optimality gap is reduced from 4.53% to 3.67% for TSP with 100 nodes and from 7.34% to 6.68% for CVRP with 100 nodes when using the greedy decoding strategy. Besides, the proposed framework can be used to solve a multi-depot CVRP case without any structural modification. Furthermore, our framework uses about 1/3∼3/4 training samples compared with other existing learning methods while achieving better results. The results performed on randomly generated instances, and the benchmark instances from TSPLIB and CVRPLIB confirm that our framework has a linear running time on the problem size (number of nodes) during training and testing phases and has a good generalization performance from random instance training to real-world instance testing.
科研通智能强力驱动
Strongly Powered by AbleSci AI