统计的
聚类分析
统计
集合(抽象数据类型)
星团(航天器)
完备性(序理论)
数据集
数学
层次聚类
计算机科学
数据挖掘
算法
数学分析
程序设计语言
作者
Robert Tibshirani,Guenther Walther,Trevor Hastie
标识
DOI:10.1111/1467-9868.00293
摘要
Summary We propose a method (the ‘gap statistic’) for estimating the number of clusters (groups) in a set of data. The technique uses the output of any clustering algorithm (e.g. K-means or hierarchical), comparing the change in within-cluster dispersion with that expected under an appropriate reference null distribution. Some theory is developed for the proposal and a simulation study shows that the gap statistic usually outperforms other methods that have been proposed in the literature.
科研通智能强力驱动
Strongly Powered by AbleSci AI