认知无线电
计算机科学
强化学习
频谱管理
计算机网络
认知网络
软件部署
趋同(经济学)
互联网
分布式计算
电信
人工智能
无线
万维网
经济
经济增长
操作系统
作者
Xiaohui Zhang,Ze Chen,Yinghui Zhang,Yang Liu,Minglu Jin,Tianshuang Qiu
出处
期刊:IEEE Internet of Things Journal
[Institute of Electrical and Electronics Engineers]
日期:2024-01-29
卷期号:11 (10): 17495-17509
被引量:3
标识
DOI:10.1109/jiot.2024.3359277
摘要
Integrating cognitive radio into Internet of Things (IoT) is conducive to reducing spectrum scarcity for large-scale IoT deployment, where a core technology is the design of spectrum access algorithms for effective assignment of spectrum holes. However, due to the partially observable channels and increased number of users in the cognitive radio Internet of Things (CRIoT) network, the secondary users have difficulty avoiding interferences and accessing the spectrum quickly. This study presents a distributed dynamic spectrum access (DSA) algorithm that employs a priority experience replay deep echo state Q-network (PER-DESQN) for CRIoT networks with multiple users and channels. To accelerate the Q-network convergence, we use an echo state network based on the underlying temporal correlation to estimate Q-values. Then, to resolve the Q-value overestimation and improve prediction accuracy, the estimated Q-value and decision action process are trained using a double deep Q-network (DDQN). Moreover, a priority experience replay mechanism that uses the Sum-Tree combined with importance sampling weights is proposed to optimize the DDQN to address the instability of the Q-value resulting from random sampling. As the simulation results demonstrate, the proposed algorithm can make fast and accurate DSA decisions and boost the network channel capacity significantly.
科研通智能强力驱动
Strongly Powered by AbleSci AI