亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

A Novel Resource Management Framework for Blockchain-Based Federated Learning in IoT Networks

块链 计算机科学 物联网 计算机网络 万维网 计算机安全
作者
Aman Mishra,Yash Garg,Om Jee Pandey,Mahendra K. Shukla,Athanasios V. Vasilakos,Rajesh M. Hegde
出处
期刊:IEEE transactions on sustainable computing [Institute of Electrical and Electronics Engineers]
卷期号:9 (4): 648-660 被引量:1
标识
DOI:10.1109/tsusc.2024.3358915
摘要

At present, the centralized learning models, used for IoT applications generating large amount of data, face several challenges such as bandwidth scarcity, more energy consumption, increased uses of computing resources, poor connectivity, high computational complexity, reduced privacy, and large latency towards data transfer. In order to address the aforementioned challenges, Blockchain-Enabled Federated Learning Networks (BFLNs) emerged recently, which deal with trained model parameters only, rather than raw data. BFLNs provide enhanced security along with improved energy-efficiency and Quality-of-Service (QoS). However, BFLNs suffer with the challenges of exponential increased action space in deciding various parameter levels towards training and block generation. Motivated by aforementioned challenges of BFLNs, in this work, we are proposing an actor-critic Reinforcement Learning (RL) method to model the Machine Learning Model Owner (MLMO) in selecting the optimal set of parameter levels, addressing the challenges of exponential grow of action space in BFLNs. Further, due to the implicit entropy exploration, actor-critic RL method balances the exploration-exploitation trade-off and shows better performance than most off-policy methods, on large discrete action spaces. Therefore, in this work, considering the mobile scenario of the devices, MLMO decides the data and energy levels that the mobile devices use for the training and determine the block generation rate. This leads to minimized system latency and reduced overall cost, while achieving the target accuracy. Specifically, we have used Proximal Policy Optimization (PPO) as an on-policy actor-critic method with it's two variants, one based on Monte Carlo (MC) returns and another based on Generalized Advantage Estimate (GAE). We analyzed that PPO has better exploration and sample efficiency, lesser training time, and consistently higher cumulative rewards, when compared to off-policy Deep Q-Network (DQN).

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
斯文败类应助科研通管家采纳,获得10
3秒前
科研通AI2S应助科研通管家采纳,获得10
3秒前
星辰大海应助学无止境采纳,获得10
20秒前
29秒前
学无止境发布了新的文献求助10
32秒前
34秒前
38秒前
玛琳卡迪马完成签到,获得积分10
39秒前
潜行者完成签到 ,获得积分10
1分钟前
搜集达人应助Kevin Li采纳,获得30
1分钟前
呆萌的谷波完成签到,获得积分10
1分钟前
刘膝关节健康完成签到 ,获得积分10
1分钟前
狂野的含烟完成签到 ,获得积分10
1分钟前
1分钟前
隐形曼青应助lxy采纳,获得10
1分钟前
JamesPei应助宋曦光采纳,获得10
2分钟前
2分钟前
fhw完成签到 ,获得积分10
2分钟前
lxy发布了新的文献求助10
2分钟前
3分钟前
3分钟前
Kevin Li发布了新的文献求助30
3分钟前
kyt_vip发布了新的文献求助10
3分钟前
严冰蝶完成签到 ,获得积分10
3分钟前
3分钟前
斯文败类应助lxy采纳,获得10
3分钟前
宋曦光发布了新的文献求助10
3分钟前
3分钟前
lxy发布了新的文献求助10
3分钟前
123456完成签到 ,获得积分10
4分钟前
浮游应助科研通管家采纳,获得10
4分钟前
Kevin Li完成签到,获得积分10
4分钟前
liu完成签到,获得积分20
5分钟前
5分钟前
liu发布了新的文献求助10
5分钟前
白华苍松发布了新的文献求助10
5分钟前
dateline完成签到 ,获得积分10
5分钟前
bkagyin应助liu采纳,获得20
5分钟前
6分钟前
GenX发布了新的文献求助10
6分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Applied Min-Max Approach to Missile Guidance and Control 5000
Metallurgy at high pressures and high temperatures 2000
Inorganic Chemistry Eighth Edition 1200
High Pressures-Temperatures Apparatus 1000
Free parameter models in liquid scintillation counting 1000
Standards for Molecular Testing for Red Cell, Platelet, and Neutrophil Antigens, 7th edition 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6320486
求助须知:如何正确求助?哪些是违规求助? 8136645
关于积分的说明 17057428
捐赠科研通 5374395
什么是DOI,文献DOI怎么找? 2852876
邀请新用户注册赠送积分活动 1830588
关于科研通互助平台的介绍 1682090