超图
聚类分析
可扩展性
分拆(数论)
计算机科学
光谱聚类
理论计算机科学
图形
图划分
比例(比率)
相关聚类
数据挖掘
算法
数学
人工智能
离散数学
组合数学
数据库
物理
量子力学
作者
Yiyang Yang,Sucheng Deng,Juan Lu,Yuhong Li,Zhiguo Gong,Leong Hou U,Hao Zhang
标识
DOI:10.1016/j.ins.2020.07.018
摘要
Hypergraph is popularly used for describing multi-relationships among objects in a unified manner, and spectral clustering is regarded as one of the most effective algorithms for partitioning those objects (vertices) into different communities. However, the traditional spectral clustering for hypergraph (HC) incurs expensive costs in terms of both time and space. In this paper, we propose a framework called GraphLSHC to tackle the scalability problem faced by the large scale hypergraph spectral clustering. In our solution, the hypergraph used in GraphLSHC is expanded into a general format to capture complicated higher-order relationships. Moreover, GraphLSHC is capable to simultaneously partition both vertices and hyperedges according to the “eigen-trick”, which provides an approach for reducing the computational complexity of the clustering. To improve the performance further, several hyperedge-based sampling techniques are proposed, which can supplement the sampled matrix with the whole graph information. We also give a theoretical guarantee for the error boundary of the supplement. Several experiments show the superiority of the proposed framework over the state-of-the-art algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI