强化学习
半导体器件制造
薄脆饼
调度(生产过程)
计算机科学
模糊逻辑
地铁列车时刻表
工程类
工业工程
可靠性工程
数学优化
人工智能
数学
操作系统
电气工程
作者
Junliang Wang,Pengjie Gao,Peng Zheng,Jie Zhang,W.H. Ip
标识
DOI:10.1016/j.jmsy.2021.08.008
摘要
Scheduling semiconductor wafer manufacturing systems has been viewed as one of the most challenging optimization problems owing to the complicated constraints, and dynamic system environment. This paper proposes a fuzzy hierarchical reinforcement learning (FHRL) approach to schedule a SWFS, which controls the cycle time (CT) of each wafer lot to improve on-time delivery by adjusting the priority of each wafer lot. To cope with the layer correlation and wafer correlation of CT due to the re-entrant process constraint, a hierarchical model is presented with a recurrent reinforcement learning (RL) unit in each layer to control the corresponding sub-CT of each integrated circuit layer. In each RL unit, a fuzzy reward calculator is designed to reduce the impact of uncertainty of expected finishing time caused by the rematching of a lot to a delivery batch. The results demonstrate that the mean deviation (MD) between the actual and expected completion time of wafer lots under the scheduling of the FHRL approach is only about 30 % of the compared methods in the whole SWFS.
科研通智能强力驱动
Strongly Powered by AbleSci AI