共线性
变量消去
线性回归
算法
特征选择
逐步回归
变量(数学)
数学
计算机科学
回归分析
普通最小二乘法
变量
回归
统计
人工智能
数学分析
推论
作者
Roberto Kawakami Harrop Galvão,Mário César Ugulino de Araújo,Wallace D. Fragoso,Edvan Cirino Silva,Gledson Emidio José,Sófacles Figueredo Carreiro Soares,Henrique Mohallem Paiva
标识
DOI:10.1016/j.chemolab.2007.12.004
摘要
The successive projections algorithm (SPA) is a variable selection technique designed to minimize collinearity problems in multiple linear regression (MLR). This paper proposes a modification to the basic SPA formulation aimed at further improving the parsimony of the resulting MLR model. For this purpose, an elimination procedure is incorporated to the algorithm in order to remove variables that do not effectively contribute towards the prediction ability of the model as indicated by an F-test. The utility of the proposed modification is illustrated in a simulation study, as well as in two application examples involving the analysis of diesel and corn samples by near-infrared (NIR) spectroscopy. The results demonstrate that the number of variables selected by SPA can be reduced without significantly compromising prediction performance. In addition, SPA is favourably compared with classic Stepwise Regression and full-spectrum PLS. A graphical user interface for SPA is available at www.ele.ita.br/∼kawakami/spa/.
科研通智能强力驱动
Strongly Powered by AbleSci AI