计算机科学
强化学习
流水车间调度
启发式
作业车间调度
调度(生产过程)
排列(音乐)
人工神经网络
人工智能
算法
数学优化
最优化问题
地铁列车时刻表
数学
操作系统
物理
声学
作者
Zixiao Pan,Ling Wang,Jing-jing Wang,Jiawen Lu
出处
期刊:IEEE transactions on emerging topics in computational intelligence
[Institute of Electrical and Electronics Engineers]
日期:2023-08-01
卷期号:7 (4): 983-994
被引量:16
标识
DOI:10.1109/tetci.2021.3098354
摘要
As a new analogy paradigm of human learning process, reinforcement learning (RL) has become an emerging topic in computational intelligence (CI). The synergy between the RL and CI is an emerging way to develop efficient solution algorithms for solving complex combinatorial optimization (CO) problems like machine scheduling problem. In this paper, we proposed an efficient optimization algorithm based on Deep RL for solving permutation flow-shop scheduling problem (PFSP) to minimize the maximum completion time. Firstly, a new deep neural network (PFSPNet) is designed for the PFSP to achieve the end-to-end output without limitation of problem sizes. Secondly, an actor-critic method of RL is used to train the PFSPNet without depending on the collection of high-quality labelled data. Thirdly, an improvement strategy is designed to refine the solution provided by the PFSPNet. Simulation results and statistical comparison show that the proposed optimization algorithm based on deep RL can obtain better results than the existing heuristics in similar computational time for solving the PFSP.
科研通智能强力驱动
Strongly Powered by AbleSci AI