强化学习
序列(生物学)
计算机科学
平面图(考古学)
人工神经网络
价值(数学)
人工智能
机器学习
工程类
历史
遗传学
考古
生物
作者
Minghui Zhao,Xian Guo,Xuebo Zhang,Yongchun Fang,Yongsheng Ou
出处
期刊:Assembly Automation
[Emerald (MCB UP)]
日期:2019-08-12
卷期号:40 (1): 65-75
被引量:16
标识
DOI:10.1108/aa-11-2018-0211
摘要
Purpose This paper aims to automatically plan sequence for complex assembly products and improve assembly efficiency. Design/methodology/approach An assembly sequence planning system for workpieces (ASPW) based on deep reinforcement learning is proposed in this paper. However, there exist enormous challenges for using DRL to this problem due to the sparse reward and the lack of training environment. In this paper, a novel ASPW-DQN algorithm is proposed and a training platform is built to overcome these challenges. Findings The system can get a good decision-making result and a generalized model suitable for other assembly problems. The experiments conducted in Gazebo show good results and great potential of this approach. Originality/value The proposed ASPW-DQN unites the curriculum learning and parameter transfer, which can avoid the explosive growth of assembly relations and improve system efficiency. It is combined with realistic physics simulation engine Gazebo to provide required training environment. Additionally with the effect of deep neural networks, the result can be easily applied to other similar tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI