自编码
聚类分析
计算机科学
人工智能
模式识别(心理学)
特征(语言学)
特征选择
特征学习
选择(遗传算法)
高维数据聚类
数据挖掘
相关聚类
深度学习
哲学
语言学
作者
Woojin Doo,Heeyoung Kim
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:: 1-17
被引量:2
标识
DOI:10.1109/tkde.2023.3323580
摘要
Existing deep learning methods for clustering high-dimensional data perform feature selection and clustering separately, which can result in the exclusion of some important features for clustering. In this paper, we propose a method that performs deep clustering and feature selection simultaneously by inserting a concrete selector layer between the input layer and the first encoder layer of a modified autoencoder. The concrete selector layer performs feature selection, while the modified autoencoder performs clustering in the latent space by incorporating K-means loss and inter-cluster distances. The proposed method, called the K-concrete autoencoder, selects features important for clustering and uses only the selected features to learn K-means-friendly latent representations of the data. Moreover, we propose an extension of the K-concrete autoencoder to provide relative importance of each selected feature. We demonstrate the effectiveness of the proposed method using simulated and real datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI