计算机科学
水准点(测量)
推论
统计推断
计算统计学
数据科学
分类
大数据
实施
软件
数据挖掘
封面(代数)
机器学习
人工智能
软件工程
工程类
程序设计语言
地理
统计
机械工程
数学
大地测量学
作者
Ling Zhou,Ziyang Gong,Xiang Pan
出处
期刊:Annual review of statistics and its application
[Annual Reviews]
日期:2023-11-17
卷期号:11 (1)
标识
DOI:10.1146/annurev-statistics-040522-021241
摘要
Data are distributed across different sites due to computing facility limitations or data privacy considerations. Conventional centralized methods—those in which all datasets are stored and processed in a central computing facility—are not applicable in practice. Therefore, it has become necessary to develop distributed learning approaches that have good inference or predictive accuracy while remaining free of individual data or obeying policies and regulations to protect privacy. In this article, we introduce the basic idea of distributed learning and conduct a selected review on various distributed learning methods, which are categorized by their statistical accuracy, computational efficiency, heterogeneity, and privacy. This categorization can help evaluate newly proposed methods from different aspects. Moreover, we provide up-to-date descriptions of the existing theoretical results that cover statistical equivalency and computational efficiency under different statistical learning frameworks. Finally, we provide existing software implementations and benchmark datasets, and we discuss future research opportunities. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
科研通智能强力驱动
Strongly Powered by AbleSci AI