期刊:IEEE Sensors Journal [Institute of Electrical and Electronics Engineers] 日期:2022-11-28卷期号:23 (11): 11262-11273被引量:16
标识
DOI:10.1109/jsen.2022.3222575
摘要
To solve the problem of active collision avoidance for unmanned surface vehicles (USVs) in a complex maritime environment, a method of deep reinforcement learning (DRL) based on proximal policy optimization (PPO) is proposed. To master the collision avoidance policy by self-learning, the mathematical model of USV, dynamic obstacle generation model, and reward mechanism were established. Using the relative position between the obstacle and USV and the information of the closest point of approach (CPA), high-dimensional state features including the collision track layer and collision threat layer were constructed. On this basis, combined with low-dimensional states such as navigation state and path error, a deep convolutional neural network (CNN) structure fused in multifeature and multiscale was designed. The proposed DRL network was trained through repeated collision avoidance simulations in random environments. Simulation results reveal that the proposed algorithm can output effective decisions in complex scenarios and avoid dynamic obstacles quickly and safely.