A deep reinforcement learning approach for solving the Traveling Salesman Problem with Drone

无人机 旅行商问题 强化学习 布线(电子设计自动化) 车辆路径问题 计算机科学 节点(物理) 2-选项 人工智能 数学优化 工程类 数学 生物 计算机网络 算法 结构工程 遗传学
作者
Aigerim Bogyrbayeva,Taehyun Yoon,Hanbum Ko,Sungbin Lim,Hyokun Yun,Changhyun Kwon
出处
期刊:Transportation Research Part C-emerging Technologies [Elsevier BV]
卷期号:148: 103981-103981 被引量:74
标识
DOI:10.1016/j.trc.2022.103981
摘要

Reinforcement learning has recently shown promise in learning quality solutions in many combinatorial optimization problems. In particular, the attention-based encoder-decoder models show high effectiveness on various routing problems, including the Traveling Salesman Problem (TSP). Unfortunately, they perform poorly for the TSP with Drone (TSP-D), requiring routing a heterogeneous fleet of vehicles in coordination—a truck and a drone. In TSP-D, the two vehicles are moving in tandem and may need to wait at a node for the other vehicle to join. State-less attention-based decoder fails to make such coordination between vehicles. We propose a hybrid model that uses an attention encoder and a Long Short-Term Memory (LSTM) network decoder, in which the decoder’s hidden state can represent the sequence of actions made. We empirically demonstrate that such a hybrid model improves upon a purely attention-based model for both solution quality and computational efficiency. Our experiments on the min-max Capacitated Vehicle Routing Problem (mmCVRP) also confirm that the hybrid model is more suitable for the coordinated routing of multiple vehicles than the attention-based model. The proposed model demonstrates comparable results as the operations research baseline methods.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
斯文的老虎完成签到,获得积分10
2秒前
2秒前
3秒前
同学甲发布了新的文献求助10
4秒前
yangminmin完成签到,获得积分20
4秒前
8秒前
10秒前
难过盼海完成签到,获得积分10
11秒前
小二郎应助琉光如喻采纳,获得10
12秒前
伯赏满天发布了新的文献求助10
12秒前
GRJ发布了新的文献求助10
13秒前
jzh完成签到,获得积分20
13秒前
14秒前
timumrxzz发布了新的文献求助10
14秒前
发一区完成签到,获得积分10
14秒前
小肆完成签到 ,获得积分10
15秒前
Orange应助sadsa采纳,获得10
17秒前
17秒前
18秒前
specium完成签到,获得积分10
18秒前
19秒前
20秒前
21秒前
小太阳在营业应助zw采纳,获得10
21秒前
小鱼完成签到,获得积分10
22秒前
8R60d8应助科研通管家采纳,获得10
23秒前
研友_VZG7GZ应助科研通管家采纳,获得10
23秒前
所所应助科研通管家采纳,获得10
23秒前
orixero应助科研通管家采纳,获得10
23秒前
Hello应助科研通管家采纳,获得10
23秒前
在水一方应助科研通管家采纳,获得10
23秒前
8R60d8应助科研通管家采纳,获得10
23秒前
今后应助科研通管家采纳,获得10
23秒前
8R60d8应助科研通管家采纳,获得10
23秒前
乐乐应助科研通管家采纳,获得10
23秒前
传奇3应助科研通管家采纳,获得10
23秒前
24秒前
24秒前
24秒前
deng发布了新的文献求助10
25秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The impact of workplace variables on juvenile probation officers’ job satisfaction 1000
When the badge of honor holds no meaning anymore 1000
HANDBOOK OF CHEMISTRY AND PHYSICS 106th edition 1000
ASPEN Adult Nutrition Support Core Curriculum, Fourth Edition 1000
AnnualResearch andConsultation Report of Panorama survey and Investment strategy onChinaIndustry 1000
Continuing Syntax 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6279766
求助须知:如何正确求助?哪些是违规求助? 8098830
关于积分的说明 16931919
捐赠科研通 5347615
什么是DOI,文献DOI怎么找? 2842714
邀请新用户注册赠送积分活动 1820069
关于科研通互助平台的介绍 1677126