计算机科学
特征选择
非负矩阵分解
离群值
模式识别(心理学)
矩阵分解
人工智能
冗余(工程)
水准点(测量)
规范(哲学)
机器学习
特征(语言学)
数据挖掘
特征向量
哲学
法学
地理
大地测量学
物理
操作系统
量子力学
语言学
政治学
作者
Shixuan Zhou,Peng Song,Song Zhang,Liang Ji
标识
DOI:10.1016/j.eswa.2022.119468
摘要
Unsupervised feature selection (UFS) aims to select the most representative features from the original data, which can efficiently reduce the influence of redundancy, outliers and noises. Over the past decades, various UFS algorithms have been proposed. However, these methods often do not consider the necessity of sparsity or ignore the fuzziness of the data. To tackle these shortcomings, in this paper, a novel soft-label guided non-negative matrix factorization (SLNMF) method is proposed. Specifically, both the convex NMF and ℓ2,1−norm regularization are introduced to ensure the sparsity of the feature selection matrix. Furthermore, the soft-label matrix based on local distance is used to supervise the feature selection, and a linear regression is developed to find the correlation between the low-dimensional representation and the soft-label space. Finally, extensive experiments on several benchmark datasets are conducted. The results show that the proposed method is advanced over several state-of-the-art UFS methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI