主成分分析
计算机科学
离群值
规范化(社会学)
人工智能
模式识别(心理学)
缩小
预处理器
数据预处理
降维
稳健主成分分析
噪音(视频)
数据挖掘
降噪
机器学习
社会学
人类学
图像(数学)
程序设计语言
作者
Yunlong Gao,Yuzhe Feng,Youwei Xie,Jinyan Pan,Feiping Nie
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2023-10-18
卷期号:: 1-13
被引量:2
标识
DOI:10.1109/tkde.2023.3325462
摘要
Principal component analysis (PCA) is one of the most versatile techniques for unsupervised dimension reduction, which is implemented as a fundamental preprocessing method in multiple tasks of statistics and machine learning research because of its efficiency. Nevertheless, researchers have concentrated on the identification of outliers that do not conform to the low-dimensional approximation through statistical methods, e.g., outlier rejection, without giving insights on each data point with a dynamic ratio of signal-to-noise components in the high-dimensional regimes. To characterize the dynamic nature of the principal component information, we propose a Normalized Robust PCA with Adaptive Reconstruction Error minimization model, which considers both the adaptive normalization technique and flexible weights learning simultaneously. With this configuration, the principal component information constantly adjusts the degree of sparsity for activated samples. In other words, the signal component's discrimination and noise information restriction could work cooperatively. Empirical studies on one synthetic dataset and several benchmarks demonstrate the effectiveness of our proposed method over existing outlier rejection methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI