质心
球(数学)
计算
有界函数
星团(航天器)
k-最近邻算法
算法
计算机科学
线段
数学
几何学
人工智能
数学分析
程序设计语言
作者
Shuyin Xia,Daowan Peng,Deyu Meng,Changqing Zhang,Guoyin Wang,Elisabeth Giem,Wei Wei,Zizhong Chen
标识
DOI:10.1109/tpami.2020.3008694
摘要
This paper presents a novel accelerated exact k-means called as "Ball k-means" by using the ball to describe each cluster, which focus on reducing the point-centroid distance computation. The "Ball k-means" can exactly find its neighbor clusters for each cluster, resulting distance computations only between a point and its neighbor clusters' centroids instead of all centroids. What's more, each cluster can be divided into "stable area" and "active area", and the latter one is further divided into some exact "annular area". The assignment of the points in the "stable area" is not changed while the points in each "annular area" will be adjusted within a few neighbor clusters. There are no upper or lower bounds in the whole process. Moreover, ball k-means uses ball clusters and neighbor searching along with multiple novel stratagems for reducing centroid distance computations. In comparison with the current state-of-the art accelerated exact bounded methods, the Yinyang algorithm and the Exponion algorithm, as well as other top-of-the-line tree-based and bounded methods, the ball k-means attains both higher performance and performs fewer distance calculations, especially for large-k problems. The faster speed, no extra parameters and simpler design of "Ball k-means" make it an all-around replacement of the naive k-means.
科研通智能强力驱动
Strongly Powered by AbleSci AI