计算机科学
去相关
外稃(植物学)
算法
分而治之算法
数学证明
可扩展性
加性模型
估计员
组分(热力学)
广义加性模型
特征选择
推论
航程(航空)
特征(语言学)
钥匙(锁)
数据挖掘
理论计算机科学
数学
人工智能
机器学习
统计
数据库
材料科学
几何学
生态学
复合材料
哲学
物理
热力学
生物
禾本科
语言学
计算机安全
作者
Yifan He,Ruiyang Wu,Yong Zhou,Yang Feng
标识
DOI:10.1080/01621459.2023.2225743
摘要
AbstractAbstract–Distributed statistical learning has become a popular technique for large-scale data analysis. Most existing work in this area focuses on dividing the observations, but we propose a new algorithm, DDAC-SpAM, which divides the features under a high-dimensional sparse additive model. Our approach involves three steps: divide, decorrelate, and conquer. The decorrelation operation enables each local estimator to recover the sparsity pattern for each additive component without imposing strict constraints on the correlation structure among variables. The effectiveness and efficiency of the proposed algorithm are demonstrated through theoretical analysis and empirical results on both synthetic and real data. The theoretical results include both the consistent sparsity pattern recovery as well as statistical inference for each additive functional component. Our approach provides a practical solution for fitting sparse additive models, with promising applications in a wide range of domains. Supplementary materials for this article are available online.KEYWORDS: Additive modelConsistencyDecorrelate and conquerDivideFeature space partitionVariable selection Supplementary MaterialsThe supplementary material consists of Lemma S.1–S.6 and the proofs of all lemmas, theorems, and corollaries.AcknowledgmentsWe thank the editor, the AE, and anonymous reviewers for their insightful comments which have greatly improved the scope and quality of the article.Disclosure StatementThe authors report there are no competing interests to declare.Additional informationFundingZhou was supported by the State Key Program of National Natural Science Foundation of China (71931004) and National Natural Science Foundation of China (92046005) and the National Key R&D Program of China (2021YFA1000100, 2021YFA1000101). Feng was supported by NIH grant 1R21AG074205-01, NYU University Research Challenge Fund, a grant from NYU School of Global Public Health, and through the NYU IT High Performance Computing resources, services, and staff expertise.
科研通智能强力驱动
Strongly Powered by AbleSci AI