强化学习
计算机科学
避碰
弹道
运动规划
路径(计算)
实时计算
旅行商问题
轨迹优化
碰撞
分布式计算
数学优化
人工智能
计算机网络
算法
机器人
物理
计算机安全
数学
天文
作者
Yu-Hsin Hsu,Rung-Hung Gau
标识
DOI:10.1109/tmc.2020.3003639
摘要
In this paper, we propose a reinforcement learning approach of collision avoidance and investigate optimal trajectory planning for unmanned aerial vehicle (UAV) communication networks. Specifically, each UAV takes charge of delivering objects in the forward path and collecting data from heterogeneous ground IoT devices in the backward path. We adopt reinforcement learning for assisting UAVs to learn collision avoidance without knowing the trajectories of other UAVs in advance. In addition, for each UAV, we use optimization theory to find out a shortest backward path that assures data collection from all associated IoT devices. To obtain an optimal visiting order for IoT devices, we formulate and solve a no-return traveling salesman problem. Given a visiting order, we formulate and solve a sequence of convex optimization problems to obtain line segments of an optimal backward path for heterogeneous ground IoT devices. We use analytical results and simulation results to justify the usage of the proposed approach. Simulation results show that the proposed approach is superior to a number of alternative approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI