稳健主成分分析
奇异值分解
计算机科学
可扩展性
矩阵分解
秩(图论)
主成分分析
稀疏矩阵
因式分解
稳健性(进化)
算法
计算复杂性理论
矩阵完成
人工智能
数学
组合数学
物理
特征向量
高斯分布
基因
数据库
化学
量子力学
生物化学
作者
Chong Peng,Yongyong Chen,Zhao Kang,Chenglizhao Chen,Qiang Cheng
标识
DOI:10.1016/j.ins.2019.09.074
摘要
Low-rankness has been widely observed in real world data and there is often a need to recover low-rank matrices in many machine learning and data mining problems. Robust principal component analysis (RPCA) has been used for such problems by separating the data into a low-rank and a sparse part. The convex approach to RPCA has been well studied due to its elegant properties in theory and many extensions have been developed. However, the state-of-the-art algorithms for the convex approach and their extensions are usually expensive in complexity due to the need for solving singular value decomposition (SVD) of large matrices. In this paper, we propose a novel RPCA model based on matrix tri-factorization, which only needs the computation of SVDs for very small matrices. Thus, this approach reduces the complexity of RPCA to be linear and makes it fully scalable. It also overcomes the drawback of the state-of-the-art scalable approach such as AltProj, which requires the precise knowledge of the true rank of the low-rank component. As a result, our method is about 4 times faster than AltProj. Our method can be used as a light-weight, scalable tool for RPCA in the absence of the precise value of the true rank.
科研通智能强力驱动
Strongly Powered by AbleSci AI