计算机科学
强化学习
静态路由
无线传感器网络
能源消耗
动态源路由
计算机网络
分布式计算
路由协议
布线(电子设计自动化)
无线路由协议
地理路由
链路状态路由协议
马尔可夫决策过程
基于策略的路由
马尔可夫过程
人工智能
工程类
统计
数学
电气工程
作者
Jing Ren,Jiangong Zheng,Xiaotong Guo,Tongyu Song,Xiong Wang,Sheng Wang,Wei Zhang
出处
期刊:IEEE Internet of Things Journal
[Institute of Electrical and Electronics Engineers]
日期:2023-06-27
卷期号:11 (1): 995-1011
被引量:3
标识
DOI:10.1109/jiot.2023.3289888
摘要
Wireless Sensor Networks (WSNs) enable intelligent collaborative perceptions in the Internet of Things. However, devices in WSNs are battery-powered with limited energy resources. During transmission, routing policies significantly affect the energy efficiency in terms of both energy consumption and energy balance among nodes, and further impact the network lifetime. Previous works mostly used heuristic fixed strategies to make routing decisions based on incomplete information in a distributed manner for lower control costs and faster calculation when facing numerous devices in WSNs, which easily lead to performance limitations and routing loops. To this end, we model the network lifetime maximization problem as a Decentralized Partially Observable Markov Decision Process and propose a new scheme MeFi based on Mean Field Reinforcement Learning to perform real-time energy-efficient routing policies for WSNs. The utilization of Mean Field Theory effectively simplifies the intractable interactions among numerous agents and guides the policy training. Additionally, a prioritized-sampling loop-free algorithm is developed to eliminate routing loops and avoid routing policies with significant energy consumption. Experimental results show that our scheme outperforms several algorithms by up to 50%, significantly enhancing energy efficiency and extending WSN lifetime under different circumstances.
科研通智能强力驱动
Strongly Powered by AbleSci AI