强化学习
作业车间调度
计算机科学
流水车间调度
调度(生产过程)
地铁列车时刻表
马尔可夫链
马尔可夫决策过程
工作车间
数学优化
马尔可夫过程
实时计算
人工智能
机器学习
数学
操作系统
统计
作者
Tianfang Xue,Peng Zeng,Haibin Yu
标识
DOI:10.1109/icit.2018.8352413
摘要
This paper addresses a multi-AGV flow-shop scheduling problem with a reinforcement learning method. Each AGV equipped with a robotic manipulator, operates on the fixed tracks, transporting semi-finished products between successive machines. The objectives dealt with here is to obtain a AGV schedule that minimize the average job delay and total makespan. After formulating such schedule problem as a Markov problem by defining state features, actions space and reward function, a new scheduling method is proposed, based on reinforcement learning. In this new method AGVs share full information on each machine's instant state and job being executed, making decisions thorough understanding of the entire flow shop. Simulation results demonstrate that this new method learns optimal or near-optimal solution from the past experience and provides better performance than multi-agent scheduling method in a dynamic environment.
科研通智能强力驱动
Strongly Powered by AbleSci AI