Game Combined Multi-Agent Reinforcement Learning Approach for UAV Assisted Offloading

计算机科学强化学习避障潜在博弈分布式计算架空（工程）云计算高效能源利用避碰服务器趋同（经济学）移动机器人实时计算纳什均衡机器人数学优化计算机网络人工智能工程类碰撞操作系统经济电气工程经济增长计算机安全数学

作者

Ang Gao,Qi Wang,Wei Liang,Zhiguo Ding

出处

期刊：IEEE Transactions on Vehicular Technology [Institute of Electrical and Electronics Engineers]
日期：2021-12-01 卷期号：70 (12): 12888-12901 被引量：45

标识

DOI：10.1109/tvt.2021.3121281

摘要

Air ground integrated mobile cloud computing (MCC) provides unmanned aerial vehicles (UAVs) the capability to act as an aerial relay with more flexibility and resilience. In the cloud computing architecture, the data generated by ground users (GUs) can be offloaded to the remote server for fast processing. However, the heterogeneity of mobile tasks makes the data size distributed among GUs unbalanced. Besides, the energy efficiency of UAVs movement should be carefully considered for sustainable flight and obstacle avoidance. In general, such a joint trajectory issue can hardly be formulated as a convex optimization in unpredictable and dynamic environments. This paper proposes a potential game combined multi-agent deep deterministic policy gradient (MADDPG) approach to optimize multiple UAVs' trajectory with the consideration of GUs' offloading delay, energy efficiency as well as obstacle avoidance system. In specific, we first model the issue as a mixed integer non-linear problem (MINP), in which the service assignment between multi-user and multi-UAV is solved by potential game. The convergence to a Nash Equilibrium (NE) can be achieved by distributive service assignment update with infinite iteration. Then, we optimize the trajectory with obstacle avoidance at each UAV by MADDPG approach, which has a great advantage of centralized-training and decentralized-execution to reduce the global synchronized communication overhead. UAVs movement can be optimized in continuity rather than other deep reinforcement learning (DRL) approaches generating discrete simple actions. Experiments demonstrate the proposed game-combined learning algorithm can minimize the offloading delay, enhance UAVs’ energy efficiency and avoid the obstacles at the same time.

求助该文献

Game Combined Multi-Agent Reinforcement Learning Approach for UAV Assisted Offloading

今日热心研友