Disability risk prediction model based on machine learning among Chinese healthy older adults: results from the China Health and Retirement Longitudinal Study

机器学习 逻辑回归 随机森林 接收机工作特性 人工智能 朴素贝叶斯分类器 纵向研究 医学 Lasso(编程语言) 心理干预 多层感知器 老年学 人工神经网络 计算机科学 支持向量机 精神科 万维网 病理
作者
Yuchen Han,Shaobing Wang
出处
期刊:Frontiers in Public Health [Frontiers Media]
卷期号:11
标识
DOI:10.3389/fpubh.2023.1271595
摘要

Background Predicting disability risk in healthy older adults in China is essential for timely preventive interventions, improving their quality of life, and providing scientific evidence for disability prevention. Therefore, developing a machine learning model capable of evaluating disability risk based on longitudinal research data is crucial. Methods We conducted a prospective cohort study of 2,175 older adults enrolled in the China Health and Retirement Longitudinal Study (CHARLS) between 2015 and 2018 to develop and validate this prediction model. Several machine learning algorithms (logistic regression, k-nearest neighbors, naive Bayes, multilayer perceptron, random forest, and XGBoost) were used to assess the 3-year risk of developing disability. The optimal cutoff points and adjustment parameters are explored in the training set, the prediction accuracy of the models is compared in the testing set, and the best-performing models are further interpreted. Results During a 3-year follow-up period, a total of 505 (23.22%) healthy older adult individuals developed disabilities. Among the 43 features examined, the LASSO regression identified 11 features as significant for model establishment. When comparing six different machine learning models on the testing set, the XGBoost model demonstrated the best performance across various evaluation metrics, including the highest area under the ROC curve (0.803), accuracy (0.757), sensitivity (0.790), and F1 score (0.789), while its specificity was 0.712. The decision curve analysis (DCA) indicated showed that XGBoost had the highest net benefit in most of the threshold ranges. Based on the importance of features determined by SHAP (model interpretation method), the top five important features were identified as right-hand grip strength, depressive symptoms, marital status, respiratory function, and age. Moreover, the SHAP summary plot was used to illustrate the positive or negative effects attributed to the features influenced by XGBoost. The SHAP dependence plot explained how individual features affected the output of the predictive model. Conclusion Machine learning-based prediction models can accurately evaluate the likelihood of disability in healthy older adults over a period of 3 years. A combination of XGBoost and SHAP can provide clear explanations for personalized risk prediction and offer a more intuitive understanding of the effect of key features in the model.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
咩鹿酱完成签到,获得积分10
1秒前
小菜鸟完成签到,获得积分10
1秒前
faye完成签到,获得积分10
2秒前
zcy完成签到 ,获得积分10
2秒前
2秒前
Zzz呀完成签到 ,获得积分10
2秒前
小吴发布了新的文献求助10
2秒前
光亮未来完成签到,获得积分10
3秒前
redpanda1103完成签到,获得积分10
4秒前
大奔发布了新的文献求助30
4秒前
NCU-Xzzzz发布了新的文献求助10
4秒前
七七完成签到,获得积分10
5秒前
5秒前
5秒前
Davidjun完成签到,获得积分10
5秒前
水煮自行车完成签到,获得积分10
6秒前
平凡的七月完成签到,获得积分10
6秒前
逍遥游完成签到,获得积分10
6秒前
道鹏发布了新的文献求助10
7秒前
tcf完成签到,获得积分0
7秒前
strama完成签到,获得积分10
8秒前
8秒前
淡然幻波发布了新的文献求助10
8秒前
8秒前
XYY完成签到,获得积分10
9秒前
li完成签到,获得积分10
9秒前
英俊的铭应助杜子采纳,获得10
9秒前
9秒前
9秒前
发嗲的黑夜完成签到,获得积分10
9秒前
czzlancer完成签到,获得积分0
10秒前
刚刚好完成签到,获得积分10
10秒前
时尚的梦曼完成签到,获得积分10
10秒前
Silence完成签到,获得积分10
10秒前
wang完成签到 ,获得积分10
11秒前
小黎爱吃马卡龙完成签到,获得积分10
11秒前
alixy发布了新的文献求助10
11秒前
11秒前
gdy201424完成签到,获得积分20
12秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Various Faces of Animal Metaphor in English and Polish 800
The SAGE Dictionary of Qualitative Inquiry 610
Signals, Systems, and Signal Processing 610
On the Dragon Seas, a sailor's adventures in the far east 500
Yangtze Reminiscences. Some Notes And Recollections Of Service With The China Navigation Company Ltd., 1925-1939 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6345052
求助须知:如何正确求助?哪些是违规求助? 8159704
关于积分的说明 17157932
捐赠科研通 5401167
什么是DOI,文献DOI怎么找? 2860686
邀请新用户注册赠送积分活动 1838526
关于科研通互助平台的介绍 1688041