跳频扩频
干扰
异步通信
强化学习
计算机科学
扩频
频率分配
资源配置
计算机网络
增强学习
电信
物理
人工智能
频道(广播)
热力学
作者
Ning Rao,Hua Xu,Dan Wang,Zisen Qi,Yue Zhang,Wanyi Gu,Xiang Peng
出处
期刊:IEEE Sensors Journal
[Institute of Electrical and Electronics Engineers]
日期:2024-03-06
卷期号:24 (8): 13560-13577
被引量:1
标识
DOI:10.1109/jsen.2024.3369038
摘要
Jamming against frequency-hopping spread spectrum (FHSS) in wireless sensor networks (WSNs) has been primarily investigated with the follower jamming mode. However, implementing follower jamming in practical applications encounters manifold challenges, such as stringent requirements on hardware performance and difficulties in attaining accurate synchronization with signals. Diverging from existing works, in this article, we propose a novel partial-band noise jamming (PBNJ) decision-making algorithm based on asynchronous deep reinforcement learning (DRL), which can allocate central jamming frequency and bandwidth more efficiently in FHSS jamming. First, we model the problem of allocating jamming resource of PBNJ to disrupt the FHSS communication in WSNs as a Markov decision process (MDP). Next, considering the interrelationship among decisions made by different jamming nodes (JNs), we construct a multistep decision framework in a time-division manner, and the long short-term memory (LSTM) network is leveraged to fully extract decision features from historical data, capturing correlations between jamming strategies of the deployed JNs, and guides future jamming decisions and enhances collaboration among different JNs in jamming resources allocation. Furthermore, to accelerate the convergence, we adopt the asynchronous advantage actor–critic (A3C) algorithm to optimize the allocation of central jamming frequency and bandwidth of JNs, utilizing the architecture of multithreaded parallel training, and update the actor network and critic network in an asynchronous gradient descent manner. Simulation results show that the proposed LSTM-A3C algorithm converges fast and outperforms various baselines in terms of the convergence speed, jamming success rate, and the total reward.
科研通智能强力驱动
Strongly Powered by AbleSci AI