强化学习
运动规划
避障
人工智能
计算机科学
控制工程
马尔可夫决策过程
工程类
水下
一般化
移动机器人
马尔可夫过程
机器人
海洋学
地质学
统计
数学
数学分析
作者
Behnaz Hadi,Alireza Khosravi,Pouria Sarhadi
标识
DOI:10.1016/j.apor.2022.103326
摘要
Research into intelligent motion planning methods has been driven by the growing autonomy of autonomous underwater vehicles (AUV) in complex unknown environments. Deep reinforcement learning (DRL) algorithms with actor-critic structures are optimal adaptive solutions that render online solutions for completely unknown systems. The present study proposes an adaptive motion planning and obstacle avoidance technique based on deep reinforcement learning for an AUV. The research employs a twin-delayed deep deterministic policy algorithm, which is suitable for Markov processes with continuous actions. Environmental observations are the vehicle's sensor navigation information. Motion planning is carried out without having any knowledge of the environment. A comprehensive reward function has been developed for control purposes. The proposed system is robust to the disturbances caused by ocean currents. The simulation results show that the motion planning system can precisely guide an AUV with six-degrees-of-freedom dynamics towards the target. In addition, the intelligent agent has appropriate generalization power.
科研通智能强力驱动
Strongly Powered by AbleSci AI