惩罚(心理学)
计算机科学
期限(时间)
短时记忆
循环神经网络
人工神经网络
人工智能
认知心理学
心理学
发展心理学
量子力学
物理
作者
Jiangjiang Liu,Biao Luo,Pengfei Yan,Ding Wang,Derong Liu
标识
DOI:10.1109/yac.2017.7967428
摘要
Recurrent neural networks and their variants have received huge success in many difficult tasks, such as handwriting recognition and generation, natural language processing, acoustic modeling of speech, and so on. As a kind of recurrent neural network architectures, the long short-term memory (LSTM) has attracted great attention. Most research works focus on its structures, training algorithms and topology structures. As an improvement to the structure of LSTM, a reward/punishment strategy is developed for LSTM in this paper, which we call RP-LSTM. In RP-LSTM, a reward/punishment (RP) strategy is proposed to evaluate its memory cells' memorization such that it improves its efficiency by forgetting more reasonably. Analysis of properties of the developed RP-LSTM is conducted from the neuroscience aspect. To test the performance of the developed RP-LSTM, comparative simulation studies are conducted on three structures, i.e., LSTM, LSTM with forget gate (LSTM-FG) and RP-LSTM. Simulation results on sentiment analysis model and sequence to sequence model demonstrate that RP-LSTM achieves better performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI