避障
控制理论(社会学)
障碍物
弹道
避碰
计算机科学
机器人
移动机器人
数学优化
数学
人工智能
控制(管理)
碰撞
计算机安全
物理
法学
政治学
天文
作者
Shiqi Wang,Yiyi Hu,Z.Y. Liu,Lijun Ma
标识
DOI:10.1016/j.compeleceng.2023.108753
摘要
In this paper, we propose an adaptive obstacle avoidance algorithm based on DDPG (Deep Deterministic Policy Gradient) and DWA (Dynamic-Window Approach) to study the obstacle avoidance problem of robots in complex continuous state space. First, the obstacle avoidance problem is converted into an optimal learning incentive problem, and the self-learning of the obstacle avoidance policy is realized based on DDPG; second, the DWA obstacle avoidance trajectory evaluation function is optimized using the DDPG reward incentive mechanism, and the Experience Replay mechanism; finally, the algorithm model is simulated. The experiments show that the model can significantly circumvent the deficiency of the DWA algorithm in limiting to the optimal local solution in a complex environment and solve the action output problem in the continuous velocity and turning angle value interval of the robot; through the trial and error interaction with the environment and the trajectory evaluation incentive feedback, the obstacle avoidance passing ability of the robot in a complex environment is improved.
科研通智能强力驱动
Strongly Powered by AbleSci AI