强化学习
计算机科学
车辆路径问题
指针(用户界面)
比例(比率)
人工智能
算法
布线(电子设计自动化)
数学优化
数学
计算机网络
物理
量子力学
作者
Mingyang Liu,Zheng Wang,Juntao Li
摘要
Deep Reinforcement Learning (DRL) has been successful applied to a number of fields. In recent years, many scholars have used the DRL algorithms to solve a classic combinatorial optimization problem, i.e. Vehicle Routing Problem (VRP). The scale of the problems that are solved in the literatures is small, thus it is difficult to apply the algorithm into practice where there are many large-scale instances. To solve large-scale VRPs by using DRL, this paper proposes a pre-training mechanism for online shared networks. The graph pointer network under the multi-head attention mechanism is trained in the dual-network reinforcement learning mode. The trained model can be applied to large-scale VRP with 100/300/500 customers within a certain time. The experiments reveal that our algorithm can obtain good solutions in terms of solution quality and offline solution efficiency.
科研通智能强力驱动
Strongly Powered by AbleSci AI