聚类分析
计算机科学
人工智能
样品(材料)
模式识别(心理学)
培训(气象学)
地理
色谱法
化学
气象学
作者
Hu Lu,Chao Chen,Hui Wei,Zhenghai Ma,Ke Jiang,Yingquan Wang
标识
DOI:10.1016/j.patcog.2022.108611
摘要
• This paper proposes an improved deep convolutional embedded clustering algorithm using reliable samples. • This paper designs the new deep clustering model structure and corresponding loss function. • In this study, we select reliable samples with pseudo-labels and pass them to the convolutional neural network for training to get a better clustering model. • We conducted experimental tests on four standard data sets and show the better performance compared to the state-of-the-art clustering algorithms. The deep clustering algorithm can learn the latent features of the embedded subspace, and further realize the clustering of samples in the feature space. The existing deep clustering algorithms mostly integrate neural networks and traditional clustering algorithms. However, for sample sets with many noise points, the effect of the clustering remains unsatisfactory. To address this issue, we propose an improved deep convolutional embedded clustering algorithm using reliable samples (IDCEC) in this paper. The algorithm first uses the convolutional autoencoder to extract features and cluster the samples. Then we select reliable samples with pseudo-labels and pass them to the convolutional neural network for training to get a better clustering model. We construct a new loss function for backpropagation training and implement an unsupervised deep clustering method. To verify the performance of the method proposed in this paper, we conducted experimental tests on standard data sets such as MNIST and USPS. Experimental results show that our method has better performance compared to traditional clustering algorithms and the state-of-the-art deep clustering algorithm under four clustering metrics.
科研通智能强力驱动
Strongly Powered by AbleSci AI