Distributed and Collective Deep Reinforcement Learning for Computation Offloading: A Practical Perspective

计算机科学强化学习服务器分布式计算趋同（经济学）移动边缘计算人工智能光学（聚焦）资源配置透视图（图形） GSM演进的增强数据速率机器学习计算机网络物理光学经济经济增长

作者

Xiaoyu Qiu,Weikun Zhang,Wuhui Chen,Zibin Zheng

出处

期刊：IEEE Transactions on Parallel and Distributed Systems [Institute of Electrical and Electronics Engineers]
日期：2020-12-10 卷期号：32 (5): 1085-1101 被引量：84

标识

DOI：10.1109/tpds.2020.3042599

摘要

Mobile edge computing (MEC) is a promising solution to support resource-constrained devices by offloading tasks to the edge servers. However, traditional approaches (e.g., linear programming and game-theory methods) for computation offloading mainly focus on the immediate performance, potentially leading to performance degradation in the long run. Recent breakthroughs regarding deep reinforcement learning (DRL) provide alternative methods, which focus on maximizing the cumulative reward. Nonetheless, there exists a large gap to deploy real DRL applications in MEC. This is because: 1) training a well-performed DRL agent typically requires data with large quantities and high diversity, and 2) DRL training is usually accompanied by huge costs caused by trial-and-error. To address this mismatch, we study the applications of DRL on the multi-user computation offloading problem from a more practical perspective. In particular, we propose a distributed and collective DRL algorithm called DC-DRL with several improvements: 1) a distributed and collective training scheme that assimilates knowledge from multiple MEC environments, which not only greatly increases data amount and diversity but also spreads the exploration costs, 2) an updating method called adaptive n-step learning, which can improve training efficiency without suffering from the high variance caused by distributed training, and 3) combining the advantages of deep neuroevolution and policy gradient to maximize the utilization of multiple environments and prevent the premature convergence. Lastly, evaluation results demonstrate the effectiveness of our proposed algorithm. Compared with the baselines, the exploration costs and final system costs are reduced by at least 43 and 9.4 percent, respectively.

求助该文献

最长约 10秒，即可获得该文献文件

Distributed and Collective Deep Reinforcement Learning for Computation Offloading: A Practical Perspective

今日热心研友