计算机科学
架空(工程)
GSM演进的增强数据速率
Byzantine容错
弹性(材料科学)
拜占庭式建筑
过程(计算)
随机梯度下降算法
分布式计算
算法
人工智能
人工神经网络
历史
古代史
物理
容错
热力学
操作系统
作者
Youming Tao,Sijia Cui,Wenlu Xu,Haofei Yin,Dongxiao Yu,Weifa Liang,Xiuzhen Cheng
标识
DOI:10.1109/tc.2023.3257510
摘要
Both Byzantine resilience and communication efficiency have attracted tremendous attention recently for their significance in edge federated learning. However, most existing algorithms may fail when dealing with real-world irregular data that behaves in a heavy-tailed manner. To address this issue, we study the stochastic convex and non-convex optimization problem for federated learning at edge and show how to handle heavy-tailed data while retaining the Byzantine resilience, communication efficiency and the optimal statistical error rates simultaneously. Specifically, we first present a Byzantine-resilient distributed gradient descent algorithm that can handle the heavy-tailed data and meanwhile converge under the standard assumptions. To reduce the communication overhead, we further propose another algorithm that incorporates gradient compression techniques to save communication costs during the learning process. Theoretical analysis shows that our algorithms achieve order-optimal statistical error rate in presence of Byzantine devices. Finally, we conduct extensive experiments on both synthetic and real-world datasets to verify the efficacy of our algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI