强化学习
计算机科学
最优化问题
组合优化
人工智能
旅行商问题
车辆路径问题
二次分配问题
数学优化
理论计算机科学
数学
算法
布线(电子设计自动化)
计算机网络
作者
Qi Wang,Kenneth H. Lai,Chun-Lei Tang
标识
DOI:10.1016/j.ins.2022.11.073
摘要
Combinatorial optimization, such as vehicle routing and traveling salesman problems for graphs, is NP-hard and has been studied for decades. Many methods have been proposed for its possible solution, including, but not limited to, exact algorithms, approximate algorithms, heuristic algorithms, and solution solvers. However, these methods cannot learn the problem’s internal structure nor generalize to similar or larger-scale problems. Recently, deep reinforcement learning has been applied to combinatorial optimization and has achieved convincing results. Nevertheless, the challenge of effective integration and training improvement still exists. In this study, we propose a novel framework (BDRL) that combines BERT (Bidirectional Encoder Representations from Transformers) and deep reinforcement learning to tackle combinatorial optimization over graphs by treating general optimization problems as data points under an identified data distribution. We first improved the transformer encoder of BERT to embed the combinatorial optimization graph effectively. By employing contrastive objectives, we extend BERT-like training to reinforcement learning and acquire self-attention-consistent representations. Next, we used hierarchical reinforcement learning to pre-train our model; that is, to train and fine-tune the model through an iterative process to make it more suitable for a specific combinatorial optimization problem. The results demonstrate our proposed framework’s generalization ability, efficiency, and effectiveness in multiple tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI