雅卡索引
高光谱成像
算法
数学
理论(学习稳定性)
粗集
维数之咒
摄动(天文学)
冗余(工程)
模式识别(心理学)
计算机科学
人工智能
数据挖掘
机器学习
物理
量子力学
操作系统
作者
Yao Liu,Junjie Yang,Yuehua Chen,Kezhu Tan,Liguo Wang,Xiaozhen Yan
标识
DOI:10.1016/j.chemolab.2017.08.005
摘要
Band selection is a well-known approach for reducing the dimensionality of hyperspectral data. When the neighborhood rough set theory is used to select informative bands, different criteria of the band selection algorithms may lead to different optimal band subsets. Many studies have been analyzed the classification performance of band selection algorithms and have demonstrated that different algorithms are similar for classification. Therefore, rather than evaluating band selection algorithms using only classification accuracy, their stability should also be explored. The stability of an algorithm, which is quantified by the sensitivity of the algorithm to variations in the training set, is a topic of recent interest. Most studies on stability compare the band subsets chosen either from perturbation datasets by randomly removing methods or from perturbation datasets by cross validation methods. These methods either result in an unknown degree of overlap between perturbation datasets, or an invariable degree of overlap. In this work, we propose an adjustable degree of overlap method to construct perturbation datasets, which can set different levels of perturbation. By introducing the Jaccard index as a metric of stability, we explore the stability of six band selection algorithms based on the neighborhood rough set theory. We experimentally demonstrate that the level of perturbation, the degree of overlap, the size of the subset, and the size of the neighborhood affect stability. The results show that the maximal relevance minimal redundancy difference band selection algorithm has the greatest stability overall and better classification ability.
科研通智能强力驱动
Strongly Powered by AbleSci AI