计算机科学
聚类分析
自编码
人工智能
邻接矩阵
模式识别(心理学)
特征学习
图形
邻接表
无监督学习
深度学习
高维数据聚类
理论计算机科学
算法
作者
Shunxin Xiao,Shiping Wang,Wenzhong Guo
出处
期刊:IEEE Transactions on Big Data
[Institute of Electrical and Electronics Engineers]
日期:2023-02-01
卷期号:9 (1): 254-266
被引量:9
标识
DOI:10.1109/tbdata.2022.3160477
摘要
Unsupervised clustering is a crucial issue in data mining and pattern recognition. Based on deep learning paradigms, deep clustering algorithms have been studied extensively and obtained superior performance in various applications. However, most of previous methods did not use helpful information from neighborhood relations to form group-separated space, and the feature embedding is usually distorted during the training process. To tackle the former limitation, we develop a graph convolution based unsupervised learning algorithm named Stacked Graph Autoencoder (SGAE). Specifically, SGAE utilizes the message passing mechanism to aggregate information from neighbors and obtain a meaningful and separated latent representation. Since the adjacency matrix is unavailable in clustering tasks, a graph construction approach with two pruning strategies is introduced to generate a transition matrix. To reduce the distortion caused by the multi-layered network training process, we further propose a topological structure preservation mechanism. It uses the constructed adjacency graph as supervised information, to maintain the relationship between nodes in the original space. Experiments on several popular benchmark datasets show that SGAE achieves significant improvements compared to unsupervised and semi-supervised deep clustering methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI