计算机科学
强化学习
杠杆(统计)
切片
基站
分布式计算
计算机网络
资源管理(计算)
带宽(计算)
蜂窝网络
人工智能
万维网
作者
Rongpeng Li,Chujie Wang,Zhifeng Zhao,Rongbin Guo,Honggang Zhang
标识
DOI:10.1109/lcomm.2020.3001227
摘要
Network slicing aims to efficiently provision diversified services with distinct requirements over the same physical infrastructure. Therein, in order to efficiently allocate resources across slices, demand-aware inter-slice resource management is of significant importance. In this letter, we consider a scenario that contains several slices in a radio access network with base stations that share the same physical resources (e.g., bandwidth or slots). We primarily leverage advantage actor-critic (A2C), one typical deep reinforcement learning (DRL) algorithm, to solve this problem by considering the varying service demands as the environment state and the allocated resources as the environment action. However, given that the user mobility toughens the difficulty to perceive the environment, we further incorporate the long short-term memory (LSTM) into A2C, and put forward an LSTM-A2C algorithm to track the user mobility and improve the system utility. We verify the performance of the proposed LSTM-A2C through extensive simulations.
科研通智能强力驱动
Strongly Powered by AbleSci AI