核(代数)
聚类分析
光谱聚类
字符串内核
核方法
数学
可分离空间
特征向量
算法
分布的核嵌入
单调函数
模式识别(心理学)
基质(化学分析)
计算机科学
人工智能
支持向量机
离散数学
数学分析
物理
量子力学
材料科学
复合材料
作者
Inderjit S. Dhillon,Yuqiang Guan,Brian Kulis
标识
DOI:10.1145/1014052.1014118
摘要
Kernel k-means and spectral clustering have both been used to identify clusters that are non-linearly separable in input space. Despite significant research, these methods have remained only loosely related. In this paper, we give an explicit theoretical connection between them. We show the generality of the weighted kernel k-means objective function, and derive the spectral clustering objective of normalized cut as a special case. Given a positive definite similarity matrix, our results lead to a novel weighted kernel k-means algorithm that monotonically decreases the normalized cut. This has important implications: a) eigenvector-based algorithms, which can be computationally prohibitive, are not essential for minimizing normalized cuts, b) various techniques, such as local search and acceleration schemes, may be used to improve the quality as well as speed of kernel k-means. Finally, we present results on several interesting data sets, including diametrical clustering of large gene-expression matrices and a handwriting recognition data set.
科研通智能强力驱动
Strongly Powered by AbleSci AI