数学
一般化
人口
非参数统计
独立性(概率论)
多元统计
集合(抽象数据类型)
分类
样品(材料)
分拆(数论)
功能(生物学)
算法
统计
组合数学
计算机科学
算术
数学分析
社会学
人口学
生物
进化生物学
化学
色谱法
程序设计语言
出处
期刊:Ludwig Maximilian University of Munich - Munich Personal RePEc Archive
日期:1967-01-01
卷期号:1: 281-297
被引量:22788
摘要
This paper describes a number of applications of the 'k-means', a procedure for classifying a random sample of points in E sub N. The procedure consists of starting with k groups which each consist of a single random point, and thereafter adding the points one after another to the group whose mean each point is nearest. After a point is added to a group, the mean of that group is adjusted so as to take account of the new point. Thus at each stage there are in fact k means, one for each group. After the sample is processed in this way, the points are classified on the basis of nearness to the final means. The portions which result tend to be fficient in the sense of having low within class variance. Applications are suggested for the problems of non-linear prediction, efficient communication, non-parametric tests of independence, similarity grouping, and automatic file construction. The extension of the methods to general metric spaces is indicated. (Author)
科研通智能强力驱动
Strongly Powered by AbleSci AI