计算机科学
聚类分析
子空间拓扑
光谱聚类
约束聚类
可扩展性
相关聚类
线性子空间
判别式
理论计算机科学
分拆(数论)
时间复杂性
高维数据聚类
图形
人工智能
CURE数据聚类算法
数据挖掘
算法
数学
组合数学
几何学
数据库
作者
Man-Sheng Chen,Chang‐Dong Wang,Dong Huang,Jianhuang Lai,Philip S. Yu
标识
DOI:10.1145/3534678.3539282
摘要
Multi-view subspace clustering targets at clustering data lying in a union of low-dimensional subspaces. Generally, an n X n affinity graph is constructed, on which spectral clustering is then performed to achieve the final clustering. Both graph construction and graph partitioning of spectral clustering suffer from quadratic or even cubic time and space complexity, leading to difficulty in clustering large-scale datasets. Some efforts have recently been made to capture data distribution in multiple views by selecting key anchor bases beforehand with k-means or uniform sampling strategy. Nevertheless, few of them pay attention to the algebraic property of the anchors. How to learn a set of high-quality orthogonal bases in a unified framework, while maintaining its scalability for very large datasets, remains a big challenge. In view of this, we propose an Efficient Orthogonal Multi-view Subspace Clustering (OMSC) model with almost linear complexity. Specifically, the anchor learning, graph construction and partition are jointly modeled in a unified framework. With the mutual enhancement of each other, a more discriminative and flexible anchor representation and cluster indicator can be jointly obtained. An alternate minimizing strategy is developed to deal with the optimization problem, which is proved to have linear time complexity w.r.t. the sample number. Extensive experiments have been conducted to confirm the superiority of the proposed OMSC method. The source codes and data are available at https://github.com/ManshengChen/Code-for-OMSC-master.
科研通智能强力驱动
Strongly Powered by AbleSci AI