趋同(经济学)
解耦(概率)
计算机科学
算法
强化学习
任务(项目管理)
理论(学习稳定性)
差异(会计)
梯度下降
人工智能
人工神经网络
机器学习
工程类
控制工程
会计
业务
经济
系统工程
经济增长
作者
Huang Wang,Fajie Duan,ZHOU WEITI
标识
DOI:10.1080/00405000.2020.1809918
摘要
To solve the problem of fabric defect detection under complex illumination conditions, the Recurrent Attention Model (RAM) which is insensitive to illumination and noise differences has been introduced. However, the policy gradient algorithm in the RAM has some problems, such as the difficulty of convergence and the inefficiency of the algorithm due to the shortcomings of round updating. In this paper, the Deep Deterministic Policy Gradient- Recurrent Attention Model (DDPG-RAM) algorithm is proposed to solve the problems of policy gradient algorithm. Although the decoupling of the reinforcement learning task and classification task will lead to the inconsistency of the data, the gradient variance will be smaller, and the convergence speed and stability will be accelerated. Experiment results show that fabric defects can be detected by the proposed DDPG-RAM algorithm under complex illumination conditions. Compared with RAM and the Convolutional Neural Network (CNN), the accuracy of the decoupled algorithm is 95.24%, and the convergence speed is 50% faster than that of the RAM.
科研通智能强力驱动
Strongly Powered by AbleSci AI