强化学习
计算机科学
钢筋
人工智能
航程(航空)
国家(计算机科学)
机器学习
心理学
工程类
社会心理学
算法
航空航天工程
作者
Anthony Manchin,Ehsan Abbasnejad,Anton van den Hengel
出处
期刊:Communications in computer and information science
日期:2019-01-01
卷期号:: 223-230
被引量:36
标识
DOI:10.1007/978-3-030-36802-9_25
摘要
Attention models have had a significant positive impact on deep learning across a range of tasks. However previous attempts at integrating attention with reinforcement learning have failed to produce significant improvements. Unlike the selective attention models used in previous attempts, which constrain the attention via preconceived notions of importance, our implementation utilises the Markovian properties inherent in the state input. We propose the first combination of self attention and reinforcement learning that is capable of producing significant improvements, including new state of the art results in the Arcade Learning Environment.
科研通智能强力驱动
Strongly Powered by AbleSci AI