计算机科学
强化学习
弹道
轨迹优化
无线传感器网络
无线
运动规划
实时计算
人工智能
机器学习
最优控制
分布式计算
数学优化
计算机网络
机器人
电信
物理
天文
数学
作者
Mincheol Seong,Ohyun Jo,Kyungseop Shin
标识
DOI:10.1016/j.engappai.2023.105891
摘要
A wireless sensor network assisted by multiple autonomous unmanned aerial vehicles (UAVs) is a promising solution for harvesting data and monitoring the circumstance in various applications. However, the complicated path planning problem of each UAV is still problematic. In this paper, we propose an optimal operation strategy based on multi-agent reinforcement learning (MARL) to tackle these hurdles. Various parameters such as the number of deployed UAVs, charging start capacity, and charging complete capacity define a multi-UAV system. This approach is applicable without a time-consuming and costly policy control. We also describe how to balance multiple objectives, such as data harvesting, charging, and collision avoidance, using transfer learning. Finally, learning a policy control that generalizes multiple scenario parameters allows us to analyze the performance of individual parameters in a specific scenario, which helps find the macro-level optimal parameter within a particular scenario. Videos are available at https://github.com/mincheolseong/UAV-Trajectory-Optimizer.
科研通智能强力驱动
Strongly Powered by AbleSci AI