计算机科学
聚类分析
图形
人工智能
模式识别(心理学)
理论计算机科学
作者
Wei Xia,Qianqian Wang,Quanxue Gao,Ming Yang,Xinbo Gao
标识
DOI:10.1109/tmm.2022.3213208
摘要
Attributed graph clustering, which learns node representation from node attribute and topological graph for clustering, is a fundamental and challenging task for multimedia network-structured data analysis. Recently, graph contrastive learning (GCL)-based methods have obtained impressive clustering performance on this task. Nevertheless, there still remain some limitations to be solved: 1) most existing methods fail to consider the self-consistency between latent representations and cluster structures; and 2) most methods require a post-processing operation to get clustering labels. Such a two-step learning scheme results in models that cannot handle newly generated data, i.e ., out-of-sample (OOS) nodes. To address these issues in a unified framework, a S elf-consistent C ontrastive A ttributed G raph C lustering (SCAGC) network with pseudo-label prompt is proposed in this article. In SCAGC, by clustering labels prompt information, a self-consistent contrastive loss, which aims to maximize the consistencies of intra-cluster representations while minimizing the consistencies of inter-cluster representations, is designed for representation learning. Meanwhile, a clustering module is built to directly output clustering labels by contrasting the representation of different clusters. Thus, for the OOS nodes, SCAGC can directly calculate their clustering labels. Extensive experimental results on seven benchmark datasets have shown that SCAGC consistently outperforms 16 competitive clustering methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI