Comparison between linear regression and four different machine learning methods in selecting risk factors for osteoporosis in a Chinese female aged cohort

医学 机器学习 统计 线性回归 人工智能 体质指数 骨质疏松症 随机森林 回归 Boosting(机器学习) 均方误差 数学 计算机科学 内科学
作者
Shiow‐Jyu Tzou,Chung‐Hsin Peng,Li-Ying Huang,Fang-Yu Chen,Chun‐Heng Kuo,Chung‐Ze Wu,Ta-Wei Chu
出处
期刊:Journal of The Chinese Medical Association [Lippincott Williams & Wilkins]
卷期号:86 (11): 1028-1036 被引量:1
标识
DOI:10.1097/jcma.0000000000000999
摘要

Background: Population aging is emerging as an increasingly acute challenge for countries around the world. One particular manifestation of this phenomenon is the impact of osteoporosis on individuals and national health systems. Previous studies of risk factors for osteoporosis were conducted using traditional statistical methods, but more recent efforts have turned to machine learning approaches. Most such efforts, however, treat the target variable (bone mineral density [BMD] or fracture rate) as a categorical one, which provides no quantitative information. The present study uses five different machine learning methods to analyze the risk factors for T-score of BMD, seeking to (1) compare the prediction accuracy between different machine learning methods and traditional multiple linear regression (MLR) and (2) rank the importance of 25 different risk factors. Methods: The study sample includes 24 412 women older than 55 years with 25 related variables, applying traditional MLR and five different machine learning methods: classification and regression tree, Naïve Bayes, random forest, stochastic gradient boosting, and eXtreme gradient boosting. The metrics used for model performance comparisons are the symmetric mean absolute percentage error, relative absolute error, root relative squared error, and root mean squared error. Results: Machine learning approaches outperformed MLR for all four prediction errors. The average importance ranking of each factor generated by the machine learning methods indicates that age is the most important factor determining T-score, followed by estimated glomerular filtration rate (eGFR), body mass index (BMI), uric acid (UA), and education level. Conclusion: In a group of women older than 55 years, we demonstrated that machine learning methods provide superior performance in estimating T-Score, with age being the most important impact factor, followed by eGFR, BMI, UA, and education level.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
1秒前
包容雨柏发布了新的文献求助10
2秒前
swy发布了新的文献求助30
2秒前
4秒前
Bob完成签到,获得积分10
4秒前
4秒前
一方通行发布了新的文献求助10
6秒前
panda发布了新的文献求助10
6秒前
8秒前
Orange应助缥缈苑博采纳,获得10
8秒前
Bob发布了新的文献求助10
8秒前
星辰大海应助YJH采纳,获得10
8秒前
拼搏老鼠完成签到,获得积分20
9秒前
田様应助jj采纳,获得10
9秒前
11应助许文静采纳,获得10
10秒前
慕青应助hbnuaa采纳,获得10
10秒前
67完成签到,获得积分20
11秒前
12秒前
F123完成签到,获得积分10
12秒前
咔叽炫完成签到,获得积分20
12秒前
李健应助王启采纳,获得10
13秒前
天天快乐应助sci_zt采纳,获得10
13秒前
13秒前
CipherSage应助aluxiaozhu采纳,获得10
14秒前
XYN1发布了新的文献求助10
15秒前
15秒前
唠叨的文龙完成签到,获得积分10
16秒前
17秒前
17秒前
高贵的言完成签到,获得积分10
17秒前
19秒前
zhenhan完成签到,获得积分10
19秒前
19秒前
夹心发布了新的文献求助10
19秒前
大个应助Newky采纳,获得10
19秒前
20秒前
深情安青应助Bob采纳,获得10
21秒前
梁晓玲发布了新的文献求助10
22秒前
23秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
AnnualResearch andConsultation Report of Panorama survey and Investment strategy onChinaIndustry 1000
Continuing Syntax 1000
Signals, Systems, and Signal Processing 610
简明药物化学习题答案 500
Quasi-Interpolation 400
脑电大模型与情感脑机接口研究--郑伟龙 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6276125
求助须知:如何正确求助?哪些是违规求助? 8095847
关于积分的说明 16924021
捐赠科研通 5345648
什么是DOI,文献DOI怎么找? 2842106
邀请新用户注册赠送积分活动 1819363
关于科研通互助平台的介绍 1676573