Application of machine learning approaches for osteoporosis risk prediction in postmenopausal women

机器学习人工智能决策树医学随机森林支持向量机接收机工作特性逻辑回归骨质疏松症梯度升压预测建模内科学计算机科学

作者

Jae‐Geum Shim,Dong Woo Kim,Kyoung-Ho Ryu,Eun-Ah Cho,Jin Hee Ahn,Jeong-In Kim,Sung Hyun Lee

出处

期刊：Archives of Osteoporosis [Springer Science+Business Media]
日期：2020-10-23 卷期号：15 (1) 被引量：66

标识

摘要

Many predictive tools have been reported for assessing osteoporosis risk. The development and validation of osteoporosis risk prediction models were supported by machine learning. Osteoporosis is a silent disease until it results in fragility fractures. However, early diagnosis of osteoporosis provides an opportunity to detect and prevent fractures. We aimed to develop machine learning approaches to achieve high predictive ability for osteoporosis risk that could help primary care providers identify which women are at increased risk of osteoporosis and should therefore undergo further testing with bone densitometry. We included all postmenopausal Korean women from the Korea National Health and Nutrition Examination Surveys (KNHANES V-1, V-2) conducted in 2010 and 2011. Machine learning models using methods such as the k-nearest neighbors (KNN), decision tree (DT), random forest (RF), gradient boosting machine (GBM), support vector machine (SVM), artificial neural networks (ANN), and logistic regression (LR) were developed to predict osteoporosis risk. We analyzed the effect of applying the machine learning algorithms to the raw data and featuring the selected data only where the statistically significant variables were included as model inputs. The accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUROC) were used to evaluate performance among the seven models. A total of 1792 patients were included in this study, of which 613 had osteoporosis. The raw data consisted of 19 variables and achieved performances (in terms of AUROCs) of 0.712, 0.684, 0.727, 0.652, 0.724, 0.741, and 0.726 for KNN, DT, RF, GBM, SVM, ANN, and LR with fivefold cross-validation, respectively. The feature selected data consisted of nine variables and achieved performances (in terms of AUROCs) of 0.713, 0.685, 0.734, 0.728, 0.728, 0.743, and 0.727 for KNN, DT, RF, GBM, SVM, ANN, and LR with fivefold cross-validation, respectively. In this study, we developed and compared seven machine learning models to accurately predict osteoporosis risk. The ANN model performed best when compared to the other models, having the highest AUROC value. Applying the ANN model in the clinical environment could help primary care providers stratify osteoporosis patients and improve the prevention, detection, and early treatment of osteoporosis.

求助该文献

最长约 10秒，即可获得该文献文件

Application of machine learning approaches for osteoporosis risk prediction in postmenopausal women

今日热心研友