强化学习
计算机科学
马尔可夫决策过程
隐藏物
无线网络
分布式计算
GSM演进的增强数据速率
马尔可夫过程
计算机网络
马尔可夫链
钥匙(锁)
无线
趋同(经济学)
云计算
人工智能
机器学习
统计
电信
操作系统
经济增长
计算机安全
数学
经济
作者
Javane Rostampoor,Raviraj Adve,Ali Afana,Yahia Ahmed
标识
DOI:10.1109/tcomm.2023.3341856
摘要
Placing selected content at the edge of the network close to the users, known as caching, is an important technique to improve the efficiency of content delivery in wireless networks. In this paper, we consider caching in a cloud radio access network (C-RAN) in which the primary fronthaul link operates in the mmWave range and may switch to microwave frequencies in the case of blockage. We aim to minimize the average long-term network cost by optimizing dynamic fetching and caching decisions. Importantly, we consider the realistic case of user request distributions and blockage rates being a priori unknown and not necessarily stationary. We introduce change point detection (CPD) to detect significant changes in the environment; we couple this step with reinforcement learning (RL): our key contribution, the proposed change point detection assisted reinforcement learning (CPRL) algorithm learns the environment and (re-)optimizes the caching policy to solve the associated Markov decision process (MDP) problem. Essentially, CPD allows our learning algorithm to adapt its caching strategy to the new environment which shows faster convergence. The numerical results show that our proposed approach improves the efficiency of caching in wireless networks, making it more adaptable to changing request patterns over time.
科研通智能强力驱动
Strongly Powered by AbleSci AI