Dynamic Parallel Machine Scheduling With Deep Q-Network

符号 强化学习 计算机科学 调度(生产过程) 作业车间调度 人工智能 马尔可夫决策过程 数学优化 马尔可夫过程 数学 算术 地铁列车时刻表 统计 操作系统
作者
Chien‐Liang Liu,Chun-Jan Tseng,Tzu‐Hsuan Huang,Jhih-Wun Wang
出处
期刊:IEEE transactions on systems, man, and cybernetics [Institute of Electrical and Electronics Engineers]
卷期号:53 (11): 6792-6804 被引量:27
标识
DOI:10.1109/tsmc.2023.3289322
摘要

Parallel machine scheduling (PMS) is a common setting in many manufacturing facilities, in which each job is allowed to be processed on one of the machines of the same type. It involves scheduling $n$ jobs on $m$ machines to minimize certain objective functions. For preemptive scheduling, most problems are not only NP-hard but also difficult in practice. Moreover, many unexpected events, such as machine failure and requirement change, are inevitable in the practical production process, meaning that rescheduling is required for static scheduling methods. Deep reinforcement learning (DRL), which combines deep learning and reinforcement learning, has achieved promising results in several domains and has shown the potential to solve large Markov decision process (MDP) optimization tasks. Moreover, PMS problems can be formulated as an MDP problem, inspiring us to devise a DRL method to deal with PMS problems in a dynamic environment. We develop a novel DRL-based PMS method, called DPMS, in which the developed model considers the characteristics of PMS to design states and the reward. The actions involve dispatching rules, so DPMS can be considered a meta-dispatching-rule system that can efficiently select a sequence of dispatching rules based on the current environment or unexpected events. The experimental results demonstrate that DPMS can yield promising results in a dynamic environment by learning from the interactions between the agent and the environment. Furthermore, we conduct extensive experiments to analyze DPMS in the context of developing a DRL to deal with dynamic PMS problems.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
落后安筠完成签到 ,获得积分10
1秒前
开朗清涟完成签到,获得积分10
3秒前
Riverchase应助欢城采纳,获得10
5秒前
小盘子完成签到,获得积分10
6秒前
憨憨的小于完成签到,获得积分10
7秒前
幽默服饰完成签到,获得积分10
13秒前
怕黑的寻菱完成签到,获得积分10
15秒前
lalala发布了新的文献求助10
18秒前
汉堡包应助zhouziliang采纳,获得10
20秒前
田国兵完成签到,获得积分10
20秒前
阿姨洗铁路完成签到 ,获得积分10
22秒前
微雨若,,完成签到 ,获得积分10
22秒前
松松发布了新的文献求助20
23秒前
gypsy_scum完成签到 ,获得积分10
28秒前
懒懒羊完成签到,获得积分10
28秒前
30秒前
香蕉觅云应助王哪跑12采纳,获得10
30秒前
情怀应助向前采纳,获得10
31秒前
31秒前
lalala发布了新的文献求助10
32秒前
zhouziliang发布了新的文献求助10
34秒前
数乱了梨花完成签到 ,获得积分0
36秒前
嗯哼发布了新的文献求助10
37秒前
39秒前
精明的听寒完成签到,获得积分10
42秒前
alsen完成签到,获得积分10
43秒前
王哪跑12发布了新的文献求助10
44秒前
嗯哼完成签到,获得积分10
45秒前
46秒前
xia xianxin发布了新的文献求助10
47秒前
布吉岛呀完成签到 ,获得积分10
51秒前
向前发布了新的文献求助10
51秒前
tongke完成签到,获得积分10
53秒前
54秒前
55秒前
56秒前
GOAT应助xia xianxin采纳,获得10
57秒前
干净的琦应助xia xianxin采纳,获得10
57秒前
woxinyouyou发布了新的文献求助10
1分钟前
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Various Faces of Animal Metaphor in English and Polish 800
Signals, Systems, and Signal Processing 610
Photodetectors: From Ultraviolet to Infrared 500
On the Dragon Seas, a sailor's adventures in the far east 500
Yangtze Reminiscences. Some Notes And Recollections Of Service With The China Navigation Company Ltd., 1925-1939 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6353208
求助须知:如何正确求助?哪些是违规求助? 8168055
关于积分的说明 17191634
捐赠科研通 5409260
什么是DOI,文献DOI怎么找? 2863646
邀请新用户注册赠送积分活动 1840984
关于科研通互助平台的介绍 1689834