特征选择
随机森林
人工智能
模式识别(心理学)
分类器(UML)
Boosting(机器学习)
计算机科学
特征(语言学)
梯度升压
机器学习
语言学
哲学
作者
Yuchun Liu,Zhihui Liu,Xue Luo,Hongjingtian Zhao
标识
DOI:10.1016/j.bbe.2022.06.007
摘要
To address the problem of high feature dimensionality of Parkinson's disease medical data, this paper introduces SHapley Additive exPlanations (SHAP) value for feature selection of Parkinson's disease medical dataset. This paper combines SHAP value with four classifiers, namely deep forest (gcForest), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM) and random forest (RF), respectively. Then this paper applies them to Parkinson's disease diagnosis. First, the classifier is used to calculate the magnitude of contribution of SHAP value to the features, then the features with significant contribution in the classification task are selected, and then the data after feature selection is used as input to classify the Parkinson's disease dataset for diagnosis using the classifier. The experimental results show that compared to Fscore, analysis of variance (Anova-F) and mutual information (MI) feature selection methods, the four models based on SHAP-value feature selection achieved good classification results. The SHAP-gcForest model combined with gcForest achieves classification accuracy of 91.78% and F1-score of 0.945 when 150 features are selected. The SHAP-LightGBM model combined with LightGBM achieves classification accuracy and F1-score of 91.62% and 0.945 when 50 features are selected, respectively. The classification effectiveness is second only to the SHAP-gcForest model, but the SHAP-LightGBM model is more computationally efficient than the SHAP-gcForest model. Finally, the effectiveness of the proposed method is verified by comparing it with the results of existing literature. The findings demonstrate that machine learning with SHAP value feature selection method has good classification performance in the diagnosis of Parkinson's disease, and provides a reference for physicians in the diagnosis and prevention of Parkinson's disease.
科研通智能强力驱动
Strongly Powered by AbleSci AI