计算机科学
强化学习
避障
趋同(经济学)
一般化
任务(项目管理)
算法
障碍物
人工智能
线路规划
功能(生物学)
数学优化
机器人
移动机器人
数学分析
数学
管理
进化生物学
政治学
法学
经济
生物
经济增长
作者
Yuxiang Zhou,Jiansheng Shu,Xiaolong Zheng,Hui Hao,Huan Song
标识
DOI:10.3389/fnbot.2022.1025817
摘要
With the application and development of UAV technology and navigation and positioning technology, higher requirements are put forward for UAV maneuvering obstacle avoidance ability and real-time route planning. In this paper, for the problem of real-time UAV route planning in the unknown environment, we combine the ideas of artificial potential field method to modify the state observation and reward function, which solves the problem of sparse rewards of reinforcement learning algorithm, improves the convergence speed of the algorithm, and improves the generalization of the algorithm by step-by-step training based on the ideas of curriculum learning and transfer learning according to the difficulty of the task. The simulation results show that the improved SAC algorithm has fast convergence speed, good timeliness and strong generalization, and can better complete the UAV route planning task.
科研通智能强力驱动
Strongly Powered by AbleSci AI