糖尿病
Lasso(编程语言)
随机森林
接收机工作特性
试验装置
医学
机器学习
人工智能
回归
内科学
计算机科学
统计
数学
内分泌学
万维网
作者
Hongcheng Wei,Jie Sun,Wenqi Shan,Wenwen Xiao,Bingqian Wang,Xuan Ma,Weiyue Hu,Xinru Wang,Yankai Xia
标识
DOI:10.1016/j.scitotenv.2021.150674
摘要
With dramatically increasing prevalence, diabetes mellitus has imposed a tremendous toll on individual well-being. Humans are exposed to various environmental chemicals, which have been postulated as underappreciated but potentially modifiable diabetes risk factors. To determine the utility of environmental chemical exposure in predicting diabetes mellitus. A total of 8501 eligible participants from NHANES 2005–2016 were randomly assigned to a discovery (N = 5953) set and a validation (N = 2548) set. We applied random forest (RF) and least absolute shrinkage and selection operator (LASSO) regression with 10-fold cross-validation in the discovery set to select features, and built an optimal model to predict diabetes mellitus, blood insulin, fasting plasma glucose (FPG) and 2-h plasma glucose after oral glucose tolerance test (2-h PG after OGTT). The machine learning model using LASSO regression predicted diabetes with an area under the receiver operating characteristics (AUROC) of 0.80 and 0.78 in the discovery set and validation set, respectively. The linear model predicted blood insulin level with an R2 of 0.42 and 0.40 in the discovery set and validation set, respectively. For FPG, the discovery set and validation set yielded an R2 of 0.16 and 0.15, respectively. For 2-h PG after OGTT, the discovery set and validation set yielded an R2 of 0.18 and 0.17, respectively. We used environmental chemical exposure, constructed machine learning models and achieved relatively accurate prediction for diabetes, emphasizing the predictive value of widespread environmental chemicals for complicated diseases.
科研通智能强力驱动
Strongly Powered by AbleSci AI