运动规划
强化学习
计算机科学
灵活性(工程)
路径(计算)
搜救
运筹学
最短路径问题
趋同(经济学)
模式(计算机接口)
数学优化
人工智能
工程类
数学
图形
操作系统
经济
程序设计语言
统计
机器人
理论计算机科学
经济增长
作者
Bo Ai,Maoxin Jia,Hanwen Xu,Jin Xu,Zhen Wen,Benshuai Li,Dan Zhang
标识
DOI:10.1016/j.oceaneng.2021.110098
摘要
In maritime search and rescue (SAR), the planning of the search path will directly affect the efficiency of searching for people overboard in the search area. However, traditional SAR decision-making schemes often adopt a fixed search path planning mode, but the limits are poor flexibility, low efficiency, and insufficient intelligence. This paper plans a search path with the shortest time-consuming and priority coverage of high-probability areas, considering complete coverage of maritime SAR areas and avoiding maritime obstacles. Firstly, a maritime SAR environment model is built using marine environmental field data and electronic charts. Secondly, an autonomous coverage path planning model for maritime SAR is proposed based on reinforcement learning, in which a reward function with multiple constraints is designed to guide the navigation action of the vessel agent. In the iterative training process of the path planning model, the random action selection probability is dynamically adjusted by the nonlinear action selection policy to ensure the stable convergence of the model. Finally, the experimental verification is conducted in different small-scale maritime SAR simulation scenarios. The results indicate that the search path can cover the high-probability areas preferentially with lower repeated coverage and shorter path length compared with other path planning algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI