计算机科学
聚类分析
特征选择
度量(数据仓库)
公制(单位)
兰德指数
可用性
人工智能
数据挖掘
选择(遗传算法)
特征(语言学)
关系(数据库)
索引(排版)
班级(哲学)
星团(航天器)
模式识别(心理学)
机器学习
运营管理
万维网
语言学
哲学
人机交互
经济
程序设计语言
作者
Jorge M. Santos,Mark J. Embrechts
标识
DOI:10.1007/978-3-642-04277-5_18
摘要
The Adjusted Rand Index (ARI) is frequently used in cluster validation since it is a measure of agreement between two partitions: one given by the clustering process and the other defined by external criteria. In this paper we investigate the usability of this clustering validation measure in supervised classification problems by two different approaches: as a performance measure and in feature selection. Since ARI measures the relation between pairs of dataset elements not using information from classes (labels) it can be used to detect problems with the classification algorithm specially when combined with conventional performance measures. Instead, if we use the class information, we can apply ARI also to perform feature selection. We present the results of several experiments where we have applied ARI both as a performance measure and for feature selection showing the validity of this index for the given tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI