Dynamic Parallel Machine Scheduling With Deep Q-Network

符号 强化学习 计算机科学 调度(生产过程) 作业车间调度 人工智能 马尔可夫决策过程 数学优化 马尔可夫过程 数学 算术 地铁列车时刻表 统计 操作系统
作者
Chien‐Liang Liu,Chun-Jan Tseng,Tzu‐Hsuan Huang,Jia‐Hong Wang
出处
期刊:IEEE transactions on systems, man, and cybernetics [Institute of Electrical and Electronics Engineers]
卷期号:53 (11): 6792-6804 被引量:7
标识
DOI:10.1109/tsmc.2023.3289322
摘要

Parallel machine scheduling (PMS) is a common setting in many manufacturing facilities, in which each job is allowed to be processed on one of the machines of the same type. It involves scheduling $n$ jobs on $m$ machines to minimize certain objective functions. For preemptive scheduling, most problems are not only NP-hard but also difficult in practice. Moreover, many unexpected events, such as machine failure and requirement change, are inevitable in the practical production process, meaning that rescheduling is required for static scheduling methods. Deep reinforcement learning (DRL), which combines deep learning and reinforcement learning, has achieved promising results in several domains and has shown the potential to solve large Markov decision process (MDP) optimization tasks. Moreover, PMS problems can be formulated as an MDP problem, inspiring us to devise a DRL method to deal with PMS problems in a dynamic environment. We develop a novel DRL-based PMS method, called DPMS, in which the developed model considers the characteristics of PMS to design states and the reward. The actions involve dispatching rules, so DPMS can be considered a meta-dispatching-rule system that can efficiently select a sequence of dispatching rules based on the current environment or unexpected events. The experimental results demonstrate that DPMS can yield promising results in a dynamic environment by learning from the interactions between the agent and the environment. Furthermore, we conduct extensive experiments to analyze DPMS in the context of developing a DRL to deal with dynamic PMS problems.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
xxx1234发布了新的文献求助10
1秒前
luym完成签到,获得积分10
1秒前
yujingyang完成签到,获得积分10
2秒前
4秒前
yujingyang发布了新的文献求助30
5秒前
6秒前
Ava应助好运莲莲采纳,获得10
6秒前
6秒前
共享精神应助归仔采纳,获得10
7秒前
自由归尘发布了新的文献求助20
9秒前
10秒前
岛屿域完成签到,获得积分10
11秒前
lmd完成签到,获得积分10
13秒前
base发布了新的文献求助10
14秒前
15秒前
16秒前
上官小玉发布了新的文献求助10
19秒前
Hello应助xiaodq采纳,获得10
19秒前
猪头发布了新的文献求助10
20秒前
zhao完成签到,获得积分20
21秒前
午后狂睡发布了新的文献求助10
21秒前
静子完成签到,获得积分10
21秒前
龙傲天发布了新的文献求助10
21秒前
zong发布了新的文献求助10
21秒前
Singularity应助小欧采纳,获得10
22秒前
24秒前
24秒前
25秒前
tian发布了新的文献求助10
26秒前
27秒前
KK完成签到 ,获得积分10
27秒前
xxx1234完成签到,获得积分10
27秒前
dsaifjs完成签到,获得积分10
27秒前
27秒前
28秒前
甜甜玫瑰应助北海未暖采纳,获得20
29秒前
远了个方发布了新的文献求助10
29秒前
29秒前
zong完成签到,获得积分10
30秒前
琉琉硫关注了科研通微信公众号
31秒前
高分求助中
Evolution 2024
中国国际图书贸易总公司40周年纪念文集: 回忆录 2000
Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 2000
Die Elektra-Partitur von Richard Strauss : ein Lehrbuch für die Technik der dramatischen Komposition 1000
How to Create Beauty: De Lairesse on the Theory and Practice of Making Art 1000
Gerard de Lairesse : an artist between stage and studio 670
Formation of interface waves in dependence of the explosive welding parameters 550
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 3003888
求助须知:如何正确求助?哪些是违规求助? 2663140
关于积分的说明 7216546
捐赠科研通 2299108
什么是DOI,文献DOI怎么找? 1219395
科研通“疑难数据库(出版商)”最低求助积分说明 594430
版权声明 593089