Improving Variable Orderings of Approximate Decision Diagrams Using Reinforcement Learning

计算机科学 强化学习 启发式 可扩展性 数学优化 放松(心理学) 公制(单位) 集合(抽象数据类型) 人工智能 线性规划松弛 分析 线性规划 跳跃式监视 变量(数学) 机器学习 算法 数学 数据挖掘 心理学 社会心理学 数学分析 运营管理 数据库 经济 程序设计语言
作者
Quentin Cappart,David Bergman,Louis-Martin Rousseau,Isabeau Prémont‐Schwarz,Augustin Parjadis
出处
期刊:Informs Journal on Computing 卷期号:34 (5): 2552-2570 被引量:10
标识
DOI:10.1287/ijoc.2022.1194
摘要

Prescriptive analytics provides organizations with scalable solutions for large-scale, automated decision making. At the core of prescriptive analytics methodology is optimization, a field devoted to the study of algorithms that solve complex decision-making problems. Optimization algorithms rely heavily on generic methods for identifying tight bounds, which provide both solutions to problems and optimality guarantees. In the last decade, decision diagrams (DDs) have demonstrated significant advantages in obtaining bounds compared with the standard linear relaxation commonly used by commercial solvers. However, the quality of the bounds computed by DDs depends heavily on the variable ordering chosen for the construction. Besides, the problem of finding an ordering that optimizes a given metric is generally NP-hard. This paper studies how machine learning, specifically deep reinforcement learning (DRL), can be used to improve bounds provided by DDs, in particular through learning a good variable ordering. The introduced DRL models improve primal and dual bounds, even over standard linear programming relaxations, and are integrated in a full-fledged branch-and-bound algorithm. This paper, therefore, provides a novel mechanism for utilizing machine learning to tighten bounds, adding to recent research on using machine learning to obtain high-quality heuristic solutions and, for the first time, using machine learning to improve relaxation bounds through a generic bounding method. We apply the methods on a classic optimization problem, the maximum independent set, and demonstrate through computational testing that optimization bounds can be significantly improved through DRL. We provide the code to replicate the results obtained on the maximum independent set. Summary of Contribution: This paper studies the use of reinforcement learning to compute a variable ordering of decision diagram-based approximations for discrete optimization problems. This is among the first works to propose the use of machine learning to improve upon generic bounding methods for discrete optimization problems, thereby establishing a critical bridge between optimization and learning.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
南极熊完成签到 ,获得积分10
2秒前
天天快乐应助斯文谷秋采纳,获得10
3秒前
农大彭于晏完成签到,获得积分10
3秒前
suntee发布了新的文献求助10
3秒前
charles发布了新的文献求助10
4秒前
小蘑菇应助Feng5945采纳,获得10
5秒前
wyz完成签到,获得积分10
5秒前
angew2000完成签到,获得积分10
5秒前
7秒前
wuhuhuhu完成签到,获得积分10
7秒前
7秒前
思源应助GOUGOU2022采纳,获得10
7秒前
8秒前
suntee完成签到,获得积分10
8秒前
8秒前
Akim应助小风吹着采纳,获得10
9秒前
依米zhang完成签到,获得积分10
10秒前
10秒前
漂亮的冰菱完成签到,获得积分20
11秒前
12秒前
青青子衿完成签到 ,获得积分10
13秒前
13秒前
LZN发布了新的文献求助10
14秒前
xcky0917发布了新的文献求助10
14秒前
14秒前
14秒前
无花果应助kn采纳,获得10
15秒前
15秒前
蓝色海完成签到,获得积分10
15秒前
小鹿儿完成签到,获得积分10
16秒前
16秒前
xukai发布了新的文献求助10
16秒前
zhgj发布了新的文献求助10
17秒前
18秒前
danmoyjj应助顺利洋葱采纳,获得30
18秒前
InfoNinja应助窦天蓝采纳,获得30
18秒前
涂涂完成签到 ,获得积分10
18秒前
iuhgnor发布了新的文献求助10
19秒前
20秒前
高分求助中
Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 2000
Evolution 1100
How to Create Beauty: De Lairesse on the Theory and Practice of Making Art 1000
Gerard de Lairesse : an artist between stage and studio 670
CLSI EP47 Evaluation of Reagent Carryover Effects on Test Results, 1st Edition 550
Assessment of Ultrasonographic Measurement of Inferior Vena Cava Collapsibility Index in The Prediction of Hypotension Associated with Tourniquet Release in Total Knee Replacement Surgeries under Spinal Anesthesia 500
Selecting and Specifying Concrete Surface Preparation for Sealers, Coatings, Polymer Overlays, and Concrete Repair 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 2981602
求助须知:如何正确求助?哪些是违规求助? 2642872
关于积分的说明 7132071
捐赠科研通 2276270
什么是DOI,文献DOI怎么找? 1207460
版权声明 592084
科研通“疑难数据库(出版商)”最低求助积分说明 589858