计算机科学
聚类分析
差别隐私
背景(考古学)
联合学习
个性化
趋同(经济学)
数据挖掘
分布式计算
机器学习
万维网
古生物学
经济
生物
经济增长
作者
Guixun Luo,Naiyue Chen,Jiahuan He,Bingwei Jin,Zhiyuan Zhang,Yidong Li
标识
DOI:10.1016/j.future.2024.01.005
摘要
With the increasing number of intelligent devices joining into the Internet of Things (IoT), traditional centralized learning struggles to meet the performance requirements of terminal time-critical systems under heterogeneous data distribution. This challenge arises from the non-independent and non-identically distributed nature of data on terminal devices in real-world scenarios, which impacts the overall model convergence speed and terminal performance. As federated learning provides a privacy-preserving collaborative training framework, this paper focuses on the studying of the time response and performance issues in the context of data heterogeneity. In this paper, we propose a lightweight Randomized Response (RR) differential privacy method to protect the distribution characteristics of clients' data while quantifying their similarity. The paper introduces a community detection algorithm with linear time complexity to divide clients into clusters, which addresses inherent non-IID challenges in federated learning and meeting the rapid response requirements of time-critical systems. We conduct experiments on scenarios with different data distribution scenarios. The results show that the privacy-preserving mechanism has a negligible impact on model accuracy, and our algorithm demonstrates significant performance improvements in personalization compared to baseline methods. Additionally, in the presence of partially disconnected clients during training, compared to solo training, the pp-CFL algorithm enhances the timeliness and accuracy of the personalized local model.
科研通智能强力驱动
Strongly Powered by AbleSci AI