计算机科学
强化学习
马尔可夫决策过程
任务(项目管理)
钥匙(锁)
过程(计算)
人工智能
分布式计算
马尔可夫过程
机器学习
计算机安全
数学
统计
操作系统
经济
管理
作者
Xiaohu Zhao,Hanli Jiang,Chenyang An,Ruocheng Wu,Yijun Guo,Daquan Yang
摘要
With the increasing complexity of UAV application scenarios, the performance of a single UAV cannot meet the mission requirements. Many complex tasks need the cooperation of multiple UAVs. How to coordinate UAV resources becomes the key to mission completion. In this paper, a task model including multiple UAVs and unknown obstacles is constructed, and the model is transformed into a Markov decision process (MDP). In addition, considering the influence of strategies among UAVs, a multiagent reinforcement learning algorithm based on SAC algorithm and centralized training and decentralized execution framework, MA-SAC (Multi-Agent Soft Actor-Critic), is proposed to solve the MDP. Simulation results show that the algorithm can effectively deal with the task allocation problem of multiple UAVs in this scenario, and its performance is better than other multiagent reinforcement learning algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI