强化学习
欠驱动
路径(计算)
计算机科学
钢筋
控制(管理)
无人机
机制(生物学)
控制理论(社会学)
人工智能
模拟
工程类
计算机网络
结构工程
认识论
哲学
海洋工程
作者
Yujiao Zhao,Yong Ma,Songlin Hu
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2021-04-01
卷期号:32 (12): 5468-5478
被引量:103
标识
DOI:10.1109/tnnls.2021.3068762
摘要
This article addresses the problem of path following for underactuated unmanned surface vessels (USVs) formation via a modified deep reinforcement learning with random braking (DRLRB). A formation control model based on deep reinforcement learning (DRL) is constructed to urge USVs to form a preset formation. Specifically, an efficient reward function is designed from the perspective of velocity and error distance of each USV related to the given formation, and then a novel random braking mechanism is formulated to prevent the training of the decision-making network from falling into the local optimum and failing to achieve the training objectives. Following that, a virtual leader-based path-following guidance system is developed for the USV formation problem. Wherein, with the aid of DRLRB, our proposed system can adjust formation automatically and flexibly even when some USVs deviate from the formation. Simulation verifies the effectiveness and superiority of our formation and path-following control strategy.
科研通智能强力驱动
Strongly Powered by AbleSci AI