障碍物
强化学习
趋同(经济学)
计算机科学
增强学习
路径(计算)
运动规划
贝尔曼方程
功能(生物学)
领域(数学)
避障
数学优化
算法
人工智能
数学
地理
机器人
考古
移动机器人
进化生物学
生物
程序设计语言
经济增长
纯数学
经济
标识
DOI:10.1007/978-3-031-36014-5_24
摘要
With the broadening of UAV application fields, the working environment of UAVs has become more and more complex. Intensive, dynamic and non-convex are the main characteristics of the obstacle environment under the new demand, and the complex obstacle environment brings great challenges to the working operation and flight of UAVs. This paper puts forward a reinforcement learning algorithm named APF-Q-learning algorithm, which is the combination of the artificial potential field (APF) method and the Q-Learning algorithm, and the reward function is designed to make the value function table converge faster. The simulation results also show that the proposed algorithm can better solve the problems of local optimum and slow convergence of the value function.
科研通智能强力驱动
Strongly Powered by AbleSci AI