旅行商问题
解算器
强化学习
计算机科学
钢筋
问题解决者
数学优化
人工智能
数学
心理学
算法
计算科学
社会心理学
作者
Yubin Xiao,Di Wang,Boyang Li,Huanhuan Chen,Wei Pang,Xuan Wu,Hao Li,Dong Xu,Yanchun Liang,You Zhou
标识
DOI:10.1109/tnnls.2024.3483231
摘要
The traveling salesman problem (TSP) is a well-known combinatorial optimization problem (COP) with broad real-world applications. Recently, neural networks (NNs) have gained popularity in this research area because as shown in the literature, they provide strong heuristic solutions to TSPs. Compared to autoregressive neural approaches, nonautoregressive (NAR) networks exploit the inference parallelism to elevate inference speed but suffer from comparatively low solution quality. In this article, we propose a novel NAR model named NAR4TSP, which incorporates a specially designed architecture and an enhanced reinforcement learning (RL) strategy. To the best of our knowledge, NAR4TSP is the first TSP solver that successfully combines RL and NAR networks. The key lies in the incorporation of NAR network output decoding into the training process. NAR4TSP efficiently represents TSP-encoded information as rewards and seamlessly integrates it into RL strategies, while maintaining consistent TSP sequence constraints during both training and testing phases. Experimental results on both synthetic and real-world TSPs demonstrate that NAR4TSP outperforms five state-of-the-art (SOTA) models in terms of solution quality, inference speed, and generalization to unseen scenarios.
科研通智能强力驱动
Strongly Powered by AbleSci AI