稳健主成分分析
杠杆(统计)
主成分分析
可扩展性
稀疏PCA
计算机科学
大数据
随机投影
算法
稳健性(进化)
稀疏矩阵
秩(图论)
组分(热力学)
数据挖掘
人工智能
数学
物理
化学
高斯分布
组合数学
基因
热力学
数据库
量子力学
生物化学
作者
N. Benjamin Erichson,Peng Zheng,Krithika Manohar,Steven L. Brunton,J. Nathan Kutz,Aleksandr Y. Aravkin
摘要
Sparse principal component analysis (SPCA) has emerged as a powerful technique for modern data analysis, providing improved interpretation of low-rank structures by identifying localized spatial structures in the data and disambiguating between distinct time scales. We demonstrate a robust and scalable SPCA algorithm by formulating it as a value-function optimization problem. This viewpoint leads to a flexible and computationally efficient algorithm. Further, we can leverage randomized methods from linear algebra to extend the approach to the large-scale (big data) setting. Our proposed innovation also allows for a robust SPCA formulation which obtains meaningful sparse principal components in spite of grossly corrupted input data. The proposed algorithms are demonstrated using both synthetic and real world data, and show exceptional computational efficiency and diagnostic performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI