冗余(工程)
特征选择
模式识别(心理学)
人工智能
最小冗余特征选择
计算机科学
随机森林
相关性(法律)
主成分分析
相互信息
数据挖掘
政治学
操作系统
法学
作者
Xiaohui Ma,Zhengguang Chen,Jinming Liu
标识
DOI:10.1016/j.saa.2024.123933
摘要
Near-infrared spectroscopy (NIRS) is a rapid, nondestructive analytical technique utilized in various fields. However, the NIR data, which consists of hundreds of dimensions, may exhibit considerable duplication in the spectrum information. This redundancy might impair modeling effectiveness. As a result, feature selection on the spectral data becomes critical. The Max-Relevance Min-Redundancy (mRMR) method stands out among the different feature selection techniques for dimensional reduction. The approach depends on mutual information (MI) between random variables as the basis for feature selection and is unaffected by modeling methods. However, it is necessary to clarify the benefits of the maximum correlation minimal redundancy algorithm in the context of near-infrared spectral feature selection, as well as its adaptability to various modeling methods. This research focuses on the NIR spectral dataset of maize germination rate, and the mRMR method is utilized to select spectral features. Based on the preceding foundation, we create models for Support Vector Regression, Gaussian Process Regression, Random Forest, and Neural Networks. The experimental findings demonstrate that, among the feature selection methods employed in this paper, the Max-Relevance Min-Redundancy algorithm outperforms others regarding the corn germination rate dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI