强化学习
计算机科学
扰动(地质)
运动规划
电流(流体)
人工智能
路径(计算)
机器人
计算机网络
海洋学
地质学
古生物学
作者
Jiabao Wen,Huiao Dai,Jingyi He,Lijiao Sun,Liqing Gao
标识
DOI:10.1109/jiot.2024.3384476
摘要
With the development of society and the economy, low-carbon and low-energy means of exploiting marine resources are receiving increasing attention. Autonomous path planning is a fundamental capability for IoT Autonomous Underwater Vehicle (AUV) to carry out ocean exploration tasks. Currently, the main issue lies in the numerous disturbances and uncertainties present in the marine environment during practical applications, which can significantly impact path planning, leading to high energy consumption and carbon emissions. To address this challenge, this paper presents a sustainable reinforcement learning algorithm for handling time-varying current disturbances to achieve low-carbon AUV path planning, which is delineated into three steps. Firstly, a three-dimensional time-varying current environment is established as the environmental framework for reinforcement learning, and the dynamic model of the AUV is formulated. Secondly, to enhance training efficiency and reduce AUV's energy consumption, this paper puts forth the OCDRP (Ocean Current Disturbance Rejection PPO) algorithm, which incorporates tidal current information to enhance the AUV's resilience to time-varying currents. Lastly, expectile regression methods are introduced to facilitate the algorithm's convergence. Experimental results confirm the efficacy of the proposed algorithm and its adaptability to time-varying currents, making it an efficient, adaptable, and low-carbon sustainable path planning approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI