Setting up of a machine learning algorithm for the identification of severe liver fibrosis profile in the general US population cohort

算法 医学 人口 计算机科学 机器学习 肝病 人工智能 内科学 环境卫生
作者
Samir Hassoun,Chiara Bruckmann,Stefano Ciardullo,Gianluca Perseghin,Francesca Di Gaudio,Francesco Broccolo
出处
期刊:International Journal of Medical Informatics [Elsevier BV]
卷期号:170: 104932-104932 被引量:7
标识
DOI:10.1016/j.ijmedinf.2022.104932
摘要

The progress of digital transformation in clinical practice opens the door to transforming the current clinical line for liver disease diagnosis from a late-stage diagnosis approach to an early-stage based one. Early diagnosis of liver fibrosis can prevent the progression of the disease and decrease liver-related morbidity and mortality. We developed here a machine learning (ML) algorithm containing standard parameters that can identify liver fibrosis in the general US population. Starting from a public database (National Health and Nutrition Examination Survey, NHANES), representative of the American population with 7265 eligible subjects (control population n = 6828, with Fibroscan values E < 9.7 KPa; target population n = 437 with Fibroscan values E ≥ 9.7 KPa), we set up an SVM algorithm able to discriminate for individuals with liver fibrosis among the general US population. The algorithm set up involved the removal of missing data and a sampling optimization step to managing the data imbalance (only ∼ 5 % of the dataset is the target population). For the feature selection, we performed an unbiased analysis, starting from 33 clinical, anthropometric, and biochemical parameters regardless of their previous application as biomarkers of liver diseases. Through PCA analysis, we identified the 26 more significant features and then used them to set up a sampling method on an SVM algorithm. The best sampling technique to manage the data imbalance was found to be oversampling through the SMOTE-NC. For final model validation, we utilized a subset of 300 individuals (150 with liver fibrosis and 150 controls), subtracted from the main dataset prior to sampling. Performances were evaluated on multiple independent runs. We provide proof of concept of an ML clinical decision support tool for liver fibrosis diagnosis in the general US population. Though the presented ML model represents at this stage only a prototype, in the future, it might be implemented and potentially applied to program broad screenings for liver fibrosis.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
落寞之云发布了新的文献求助10
刚刚
慕青应助孤独的问凝采纳,获得10
1秒前
1秒前
斯文的寒安完成签到,获得积分10
1秒前
bbihk完成签到,获得积分10
1秒前
1秒前
2秒前
2秒前
英姑应助聪慧的凝海采纳,获得10
5秒前
6秒前
RONG发布了新的文献求助10
7秒前
8秒前
畅畅发布了新的文献求助10
9秒前
打打应助落寞之云采纳,获得10
10秒前
兴奋的发卡完成签到 ,获得积分10
10秒前
cw完成签到,获得积分10
10秒前
田様应助明道若昧采纳,获得10
10秒前
甜甜绮烟完成签到 ,获得积分10
11秒前
大力的灵雁应助fxy采纳,获得20
12秒前
qin123发布了新的文献求助10
12秒前
丘比特应助aqiuyuehe采纳,获得10
12秒前
13秒前
林霄完成签到,获得积分10
13秒前
南柯一梦完成签到 ,获得积分10
14秒前
17秒前
17秒前
18秒前
老福贵儿应助科研通管家采纳,获得10
18秒前
18秒前
CipherSage应助科研通管家采纳,获得20
18秒前
18秒前
华仔应助科研通管家采纳,获得10
18秒前
所所应助科研通管家采纳,获得10
18秒前
慕青应助科研通管家采纳,获得10
18秒前
CodeCraft应助科研通管家采纳,获得10
18秒前
18秒前
huanir99完成签到 ,获得积分10
19秒前
19秒前
典雅的听筠完成签到,获得积分10
20秒前
pyridine应助biubiubiu采纳,获得10
23秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Cronologia da história de Macau 1600
Decentring Leadership 1000
Lloyd's Register of Shipping's Approach to the Control of Incidents of Brittle Fracture in Ship Structures 1000
BRITTLE FRACTURE IN WELDED SHIPS 1000
Intentional optical interference with precision weapons (in Russian) Преднамеренные оптические помехи высокоточному оружию 1000
Atlas of Anatomy 5th original digital 2025的PDF高清电子版(非压缩版,大小约400-600兆,能更大就更好了) 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6183659
求助须知:如何正确求助?哪些是违规求助? 8011046
关于积分的说明 16662498
捐赠科研通 5283336
什么是DOI,文献DOI怎么找? 2816472
邀请新用户注册赠送积分活动 1796302
关于科研通互助平台的介绍 1660881