强化学习
计算机科学
人工智能
深度学习
领域(数学)
人工神经网络
机器人学
异步通信
机器学习
机器人
电信
数学
纯数学
作者
Kai Arulkumaran,Marc Peter Deisenroth,Miles Brundage,Anil A. Bharath
出处
期刊:IEEE Signal Processing Magazine
[Institute of Electrical and Electronics Engineers]
日期:2017-11-01
卷期号:34 (6): 26-38
被引量:2901
标识
DOI:10.1109/msp.2017.2743240
摘要
Deep reinforcement learning is poised to revolutionise the field of AI and represents a step towards building autonomous systems with a higher level understanding of the visual world. Currently, deep learning is enabling reinforcement learning to scale to problems that were previously intractable, such as learning to play video games directly from pixels. Deep reinforcement learning algorithms are also applied to robotics, allowing control policies for robots to be learned directly from camera inputs in the real world. In this survey, we begin with an introduction to the general field of reinforcement learning, then progress to the main streams of value-based and policy-based methods. Our survey will cover central algorithms in deep reinforcement learning, including the deep $Q$-network, trust region policy optimisation, and asynchronous advantage actor-critic. In parallel, we highlight the unique advantages of deep neural networks, focusing on visual understanding via reinforcement learning. To conclude, we describe several current areas of research within the field.
科研通智能强力驱动
Strongly Powered by AbleSci AI