强化学习
计算机科学
基线(sea)
人工智能
国家(计算机科学)
业余
机制(生物学)
动作(物理)
方案(数学)
钢筋
人机交互
机器学习
工程类
算法
政治学
法学
数学
哲学
数学分析
地质学
物理
认识论
海洋学
量子力学
结构工程
作者
Xiangxiang Shen,Chuanhuan Yin,Xinwen Hou
标识
DOI:10.1145/3325730.3325743
摘要
Reinforcement learning is concerned with how software agents ought to take actions according to the state of the environment so as to maximize some notion of cumulative reward. Therefore, in-depth study and mining of the state of the environment will be more conducive to the agent to make better decisions. Motivated by the advantages of self-attention mechanism in machine translation, this paper presents a new scheme. In this scheme, the state in deep reinforcement learning algorithms can be combined with self-attention mechanism. After that agents will pay more attention to the internal structure of state especially in a complex game environment, like real-time strategy game StarCraft. StarCraft is a huge challenge platform for AI researchers because of its huge state spaces and action spaces. Some baseline agents of reinforcement learning provided by DeepMind for mini-games in StarCraft II have not reached the level of an amateur player. Our agents use fewer features than DeepMind's baseline agents and have made significant improvement.
科研通智能强力驱动
Strongly Powered by AbleSci AI