Lyapunov优化
计算机科学
移动边缘计算
计算卸载
帧(网络)
在线算法
强化学习
数学优化
边缘计算
最优化问题
随机优化
李雅普诺夫函数
无线网络
随机规划
计算
GSM演进的增强数据速率
无线
算法
人工智能
计算机网络
李雅普诺夫方程
李雅普诺夫指数
数学
混乱的
非线性系统
电信
物理
量子力学
作者
Suzhi Bi,Liang Huang,Hui Wang,Ying Jun Zhang
标识
DOI:10.1109/twc.2021.3085319
摘要
Opportunistic computation offloading is an effective method to improve the computation performance of mobile-edge computing (MEC) networks under dynamic edge environment. In this paper, we consider a multi-user MEC network with time-varying wireless channels and stochastic user task data arrivals in sequential time frames. In particular, we aim to design an online computation offloading algorithm to maximize the network data processing capability subject to the long-term data queue stability and average power constraints. The online algorithm is practical in the sense that the decisions for each time frame are made without the assumption of knowing the future realizations of random channel conditions and data arrivals. We formulate the problem as a multi-stage stochastic mixed integer non-linear programming (MINLP) problem that jointly determines the binary offloading (each user computes the task either locally or at the edge server) and system resource allocation decisions in sequential time frames. To address the coupling in the decisions of different time frames, we propose a novel framework, named LyDROO, that combines the advantages of Lyapunov optimization and deep reinforcement learning (DRL). Specifically, LyDROO first applies Lyapunov optimization to decouple the multi-stage stochastic MINLP into deterministic per-frame MINLP subproblems. By doing so, it guarantees to satisfy all the long-term constraints by solving the per-frame subproblems that are much smaller in size. Then, LyDROO integrates model-based optimization and model-free DRL to solve the per-frame MINLP problems with very low computational complexity. Simulation results show that under various network setups, the proposed LyDROO achieves optimal computation performance while stabilizing all queues in the system. Besides, it induces very low computation time that is particularly suitable for real-time implementation in fast fading environments.
科研通智能强力驱动
Strongly Powered by AbleSci AI