概括性
强化学习
计算机科学
流水车间调度
作业车间调度
动态优先级调度
单调速率调度
调度(生产过程)
公平份额计划
端到端原则
两级调度
分布式计算
数学优化
人工智能
服务质量
计算机网络
数学
心理学
布线(电子设计自动化)
心理治疗师
作者
Linlin Zhao,Weiming Shen,Chunjiang Zhang,Kunkun Peng
标识
DOI:10.1109/cscwd54268.2022.9776116
摘要
Job shop scheduling problem (JSSP) is a typical scheduling problem in manufacturing. Traditional scheduling methods fail to guarantee both efficiency and quality in complex and changeable production environments. This paper proposes an end-to-end deep reinforcement learning (DRL) method to address the JSSP. In order to improve the quality of solutions, a network model based on transformer and attention mechanism is constructed as the actor to enable a DRL agent to search in its solution space. The Proximal policy optimization (PPO) algorithm is utilized to train the network model to learn optimal scheduling policies. The trained model generates sequential decision actions as the scheduling solution. Numerical experiment results demonstrate the superiority and generality of the proposed method compared with other three classic heuristic rules.
科研通智能强力驱动
Strongly Powered by AbleSci AI