计算机科学
启发式
嵌入
组合优化
贪婪算法
顶点覆盖
最优化问题
强化学习
利用
数学优化
旅行商问题
图形
算法
理论计算机科学
人工智能
数学
计算机安全
作者
Hanjun Dai,Elias B. Khalil,Yuyu Zhang,Bistra Dilkina,Le Song
出处
期刊:Cornell University - arXiv
日期:2017-04-05
被引量:969
标识
DOI:10.48550/arxiv.1704.01665
摘要
The design of good heuristics or approximation algorithms for NP-hard combinatorial optimization problems often requires significant specialized knowledge and trial-and-error. Can we automate this challenging, tedious process, and learn the algorithms instead? In many real-world applications, it is typically the case that the same optimization problem is solved again and again on a regular basis, maintaining the same problem structure but differing in the data. This provides an opportunity for learning heuristic algorithms that exploit the structure of such recurring problems. In this paper, we propose a unique combination of reinforcement learning and graph embedding to address this challenge. The learned greedy policy behaves like a meta-algorithm that incrementally constructs a solution, and the action is determined by the output of a graph embedding network capturing the current state of the solution. We show that our framework can be applied to a diverse range of optimization problems over graphs, and learns effective algorithms for the Minimum Vertex Cover, Maximum Cut and Traveling Salesman problems.
科研通智能强力驱动
Strongly Powered by AbleSci AI