强化学习
计算机科学
人工智能
深度学习
领域(数学)
人工神经网络
机器人学
异步通信
机器学习
机器人
电信
数学
纯数学
作者
Kai Arulkumaran,Marc Peter Deisenroth,Miles Brundage,Anil A. Bharath
标识
DOI:10.1109/msp.2017.2743240
摘要
Deep reinforcement learning (DRL) is poised to revolutionize the field of artificial intelligence (AI) and represents a step toward building autonomous systems with a higher-level understanding of the visual world. Currently, deep learning is enabling reinforcement learning (RL) to scale to problems that were previously intractable, such as learning to play video games directly from pixels. DRL algorithms are also applied to robotics, allowing control policies for robots to be learned directly from camera inputs in the real world. In this survey, we begin with an introduction to the general field of RL, then progress to the main streams of value-based and policy-based methods. Our survey will cover central algorithms in deep RL, including the deep Q-network (DQN), trust region policy optimization (TRPO), and asynchronous advantage actor critic. In parallel, we highlight the unique advantages of deep neural networks, focusing on visual understanding via RL. To conclude, we describe several current areas of research within the field.
科研通智能强力驱动
Strongly Powered by AbleSci AI