强化学习
诺玛
计算机科学
GSM演进的增强数据速率
功率(物理)
多输入多输出
边缘计算
分布式计算
钢筋
人工智能
计算机网络
心理学
电信线路
社会心理学
频道(广播)
物理
量子力学
作者
Hongbiao Zhu,Qiong Wu,Xiao‐Jun Wu,Qiang Fan,Pingyi Fan,Jiangzhou Wang
出处
期刊:IEEE Internet of Things Journal
[Institute of Electrical and Electronics Engineers]
日期:2021-12-27
卷期号:9 (14): 12770-12782
被引量:68
标识
DOI:10.1109/jiot.2021.3138434
摘要
Vehicular edge computing (VEC) is envisioned as a promising approach to process the explosive computation tasks of vehicular user (VU). In the VEC system, each VU allocates power to process partial tasks through offloading and the remaining tasks through local execution. During the offloading, each VU adopts the multi-input multi-output and non-orthogonal multiple access (MIMO-NOMA) channel to improve the channel spectrum efficiency and capacity. However, the channel condition is uncertain due to the channel interference among VUs caused by the MIMO-NOMA channel and the time-varying path loss caused by the mobility of each VU. In addition, the task arrival of each VU is stochastic in the real world. The stochastic task arrival and uncertain channel condition affect greatly on the power consumption and latency of tasks for each VU. It is critical to design an optimal power allocation scheme considering the stochastic task arrival and channel variation to optimize the long-term reward, including the power consumption and latency in the MIMO-NOMA VEC. Different from the traditional centralized deep reinforcement learning (DRL)-based scheme, this article constructs a decentralized DRL framework to formulate the power allocation optimization problem, where the local observations are selected as the state. The deep deterministic policy gradient (DDPG) algorithm is adopted to learn the optimal power allocation scheme based on the decentralized DRL framework. Simulation results demonstrate that our proposed power allocation scheme outperforms the existing schemes.
科研通智能强力驱动
Strongly Powered by AbleSci AI