计算机科学
马尔可夫决策过程
运动规划
避碰
弹道
频道(广播)
实时计算
任务(项目管理)
路径(计算)
参数化复杂度
马尔可夫过程
碰撞
数据收集
过程(计算)
分布式计算
计算机网络
人工智能
算法
机器人
计算机安全
物理
操作系统
统计
经济
管理
数学
天文
作者
Xueyuan Wang,M. Cenk Gursoy,Tugba Erpek,Yalin E. Sagduyu
出处
期刊:IEEE Internet of Things Journal
[Institute of Electrical and Electronics Engineers]
日期:2022-09-01
卷期号:9 (17): 16663-16676
被引量:39
标识
DOI:10.1109/jiot.2022.3153585
摘要
Unmanned aerial vehicles (UAVs) are expected to be an integral part of wireless networks, and determining collision-free trajectory in multi-UAV non-cooperative scenarios while collecting data from distributed Internet of Things (IoT) nodes is a challenging task. In this paper, we consider a path planning optimization problem to maximize the collected data from multiple IoT nodes under realistic constraints. The considered multi-UAV non-cooperative scenarios involve random number of other UAVs in addition to the typical UAV, and UAVs do not communicate or share information among each other. We translate the problem into a Markov decision process (MDP) with parameterized states, permissible actions, and detailed reward functions. Dueling double deep Q-network (D3QN) is proposed to learn the decision making policy for the typical UAV, without any prior knowledge of the environment (e.g., channel propagation model and locations of the obstacles) and other UAVs (e.g., their missions, movements, and policies). The proposed algorithm can adapt to various missions in various scenarios, e.g., different numbers and positions of IoT nodes, different amount of data to be collected, and different numbers and positions of other UAVs. Numerical results demonstrate that real-time navigation can be efficiently performed with high success rate, high data collection rate, and low collision rate.
科研通智能强力驱动
Strongly Powered by AbleSci AI