强化学习
计算机科学
一般化
多样性(控制论)
人工智能
比例(比率)
深度学习
电子游戏
人机交互
多媒体
数学
数学分析
物理
量子力学
作者
Elvis S. Liu,Weifan Li,Yuan Zhou,Hugh Cao,Zhengwen Zeng
标识
DOI:10.1109/cog57401.2023.10333204
摘要
Naruto Mobile is a popular mobile Fighting Game with over 100 million registered players. AI agents are deployed extensively to the game for a wide variety of applications such as level challenges and player training, which require them to fight like humans and imitate strong and weak players. Although deep reinforcement learning is an excellent approach to creating agents with diverse behaviors, it is difficult to apply to massive-scale games like Naruto Mobile which is built on a pool of more than 300 characters that have unique skills, speed, and attack range, as a traditional approach of self-play training at such scale may require a substantial computational cost and training time.In this paper, we present a new AI training approach called Heterogeneous Exploitation Self-Play (HESP) to improve AI agent generalization ability in Naruto Mobile and optimize its massive-scale self-play training so that the computational costs and train time are significantly reduced. The proposed algorithm has already been employed by the development team of Naruto Mobile to create AI agents, which, at the time of writing this paper, have been used in more than 300 million human-AI fighting matches. To the best of our knowledge, this is the first time that deep reinforcement learning has been employed by a commercial fighting game.
科研通智能强力驱动
Strongly Powered by AbleSci AI