强化学习
弹道
计算机科学
马尔可夫决策过程
避碰
碰撞
能源消耗
数据收集
马尔可夫过程
无线传感器网络
运动规划
实时计算
数学优化
人工智能
计算机网络
数学
工程类
机器人
天文
计算机安全
物理
电气工程
统计
作者
Na Su,Jun-Bo Wang,Cheng Zeng,Hua Zhang,Min Lin,Geoffrey Ye Li
出处
期刊:IEEE Internet of Things Journal
[Institute of Electrical and Electronics Engineers]
日期:2022-04-19
卷期号:9 (20): 19773-19786
被引量:16
标识
DOI:10.1109/jiot.2022.3168589
摘要
Employing unmanned surface vehicles (USVs) as marine data collectors is promising for large-scale environment sensing in remote ocean monitoring network. In this article, we consider a USV-aided marine data collection network, where a USV collects data from multiple monitoring terminals while avoiding collisions with monitoring terminals and obstacles. Aiming at minimizing energy consumption and data loss, we formulate a trajectory optimization problem with practical constraints, including collision avoidance, steering angle, and velocity limitation. The problem is intractable due to the stochastic arrived data and the random emergence and movement of dynamic obstacles. To efficiently solve it, we transform it as a constrained Markov decision process (MDP) problem and address it using a target-oriented double deep ${Q}$ -learning network (D2QN)-based collision avoidance and trajectory planning algorithm. In the proposed algorithm, the USV acts as an agent to explore and learn its trajectory planning policy by utilizing the causal knowledge. Numerical results demonstrate that the performance of the proposed algorithm is superior in terms of successful probability, energy consumption, and data loss.
科研通智能强力驱动
Strongly Powered by AbleSci AI