强化学习
断层(地质)
人工智能
计算机科学
分歧(语言学)
功能(生物学)
国家(计算机科学)
机器学习
算法
语言学
进化生物学
生物
地质学
哲学
地震学
作者
Zhe Cheng,Wei Lei,Junsheng Cheng,Niaoqing Hu
出处
期刊:Mechanisms and machine science
日期:2023-01-01
卷期号:: 615-627
标识
DOI:10.1007/978-3-031-26193-0_55
摘要
Due to the rotating machinery is a healthy state most of the time and it is difficult to obtain enough fault data, historical data will be highly skewed to the health state, which affects the accuracy of the intelligent fault diagnosis method based on conventional deep learning (DL). In other to improve the performance of DL algorithm under unbalanced samples, a deep reinforcement learning algorithm based on actor-critic architecture combining reinforcement learning (RL) and DL is proposed in this paper, it uses DL as a basic learner to perceive input information and uses RL as decision maker to determine the health status or fault type of rotating machinery. In proposed algorithm, reward function is improved in the actor module which increases reward when agent correctly recognizes the fault classification and encourages agents to pay attention to minority fault samples, Jensen–Shannon (JS) divergence is used to calculate the distance between agent output action distribution and target distribution to relieve the reward sparsity issue in the initial training stage. In addition, an improved exploration strategy is designed, its greedy factor decreases with epochs to explore the external environment as much as possible in the initial training stage. Finally, an advanced weighted regression is introduced as a loss function to ensure that the agent updates in a beneficial direction. The experiment on PHM2009 gearbox challenge data demonstrates that the improved actor-critic framework is helpful to guide the intelligent diagnosis model based on DL to better deal with unbalanced data.
科研通智能强力驱动
Strongly Powered by AbleSci AI