Learning strategies for underwater robot autonomous manipulation control

强化学习过程（计算）计算机科学人工智能理论（学习稳定性）水下避障趋同（经济学）机器人控制（管理）状态空间机器人学习控制工程机器学习工程类移动机器人海洋学地质学统计数学经济增长经济操作系统

作者

Hai Huang,Tao Jiang,Zongyu Zhang,Yize Sun,Hongde Qin,Xinyang Li,Xu Yang

出处

期刊：Journal of The Franklin Institute-engineering and Applied Mathematics [Elsevier BV]
日期：2024-03-19 卷期号：361 (7): 106773-106773 被引量：2

标识

DOI：10.1016/j.jfranklin.2024.106773

摘要

Autonomous manipulation operations represent the high intelligent coordination from robotic vision and control, it is also a symbol of the advances of robotic intelligence. The limitations of visual sensing and the increasingly complex experimental conditions make autonomous manipulation operations more difficult, particularly for deep reinforcement learning methods, which can enhance robotic control intelligence but require a lot of training process. Due to the high-dimensional continuous state space and continuous action space characteristics of underwater operations, this paper adopts a policy-based reinforcement learning method as the foundational approach. To address the issues of instability and low convergence efficiency in traditional policy-based reinforcement learning algorithms during the learning process, this paper proposes a novel policy learning method. This method adopts the Proximal Policy Optimization algorithm (PPOClip) and optimizes it through an actor-critic network. The aim is to improve the stability and effectiveness of convergence in the learning process. In the underwater training environment, a new reward shaping scheme has been designed to address the issue of reward sparsity during the training process. The manually crafted dense reward function is utilized as attractive and repulsive potential functions for goal manipulation and obstacle avoidance. On the highly complex underwater manipulation and training environment, transferred learning algorithm has been established to reduce the training times and compensate the differences between the simulation and experiment. Simulations and tank experiments have verified the performance of the proposed strategy learning method.

求助该文献

最长约 10秒，即可获得该文献文件

Learning strategies for underwater robot autonomous manipulation control

今日热心研友