清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

Explainable machine learning for chronic lymphocytic leukemia treatment prediction using only inexpensive tests

医学 慢性淋巴细胞白血病 人口 预期寿命 疾病 机器学习 人工智能 内科学 重症监护医学 白血病 计算机科学 环境卫生
作者
Amiel Meiseles,Denis Paley,Mira Ziv,Yarin Hadid,Lior Rokach,Tamar Tadmor
出处
期刊:Computers in Biology and Medicine [Elsevier]
卷期号:145: 105490-105490 被引量:11
标识
DOI:10.1016/j.compbiomed.2022.105490
摘要

Chronic lymphocytic leukemia (CLL) is one of the most common types of leukemia in the western world which affects mainly the elderly population. Progress of the disease is very heterogeneous both in terms of necessity of treatment and life expectancy. The current scoring system for prognostic evaluation of patients with CLL is called CLL-IPI and predicts the general progress of the disease but is not a measure or a decision aid for the necessity of treatment. Due to the heterogeneous behavior of CLL it is important to develop tools that will identify if and when patients will necessitate treatment for CLL. Recently, Machine Learning (ML) has spread to many public health fields including diagnosis and prognosis of diseases. Existing machine learning methods for CLL treatment prediction rely on expensive tests, such as genetic tests, rendering them useless in peripheral or low-resource clinics such as those in developing countries. We aim to develop a model for predicting whether a patient will need treatment for CLL within two years of diagnosis using a machine learning model based on only on demographic data and routine laboratory tests. We conducted a single center study that included adult patients (above the age of 18) that were diagnosed with CLL according to the IWCLL criteria and were under observation at the hematology unit of the Bnai-Zion medical center between 2009 and 2019. Patient data include demographic, clinical and laboratory measures that were extracted from patients’ medical records anonymously. All laboratory results, during the observation period, were extracted for the entire cohort. Multiple ML approaches for classifying whether a patient will require treatment during a predetermined period of 2 years were evaluated. Performance of the ML models was measured using repeated cross validation. We evaluated the use of SHapley Additive exPlanation (SHAP) for explaining what influences the models decision. Additionally, we employ a method for extracting a single decision tree from the ML model which enables the doctor to understand the main logic governing the model prediction. The study included 109 patients of them 67 males (61%). Patients were under observation for a median of 44 months and the median age was 65 (age range: 45–87). 64% of the cohort received therapy during follow-up. A Gradient Boosting Model (GBM) model using all of the extracted variables to identify the need for treatment in the coming two years among patients with CLL achieved the AUPRC of 0.78 (±0.08). An identical GBM model, without genetic/FISH and flowcytometry (FACS) data, such that it can be used in peripheral clinics, scored an AUPRC of 0.7686 (±0.0837). A Generalized Linear Model (GLM) using the same features, scored an AUPRC of 0.7535 (±0.0995). All the models described above surpassed the performance of CLL-IPI that was evaluated using the CLL-TIM model. According to the SHAP results, red blood cell (RBC) count was the most predictive value for the necessity for treatment, where a high value is associated with a low probability of requiring treatment in the coming two years. Additionally, the SHAP method was used for estimating the personal risk of a random patient and showed sensible results. A simple Decision Tree classifier showed that patients who had a hemoglobin level of less than 13 gm/dL and a Neutrophil to Lymphocyte Ratio (NLR) less than 0.063, which constituted 34% percent of the patients included in our study, had a high probability (76%) of requiring treatment. Machine Learning algorithms that were evaluated in this work for predicting the necessity of treatment for patients with CLL achieved reasonable accuracy which surpassed that of CLL-IPI which was evaluated using the CLL-TIM model. Furthermore, we found that a machine learning model trained exclusively using inexpensive features only incurred a modest decrease in performance compared to the model trained using all of the features. Due to the small number of patients in this study it is necessary to validate the results on a larger population. • Machine Learning methods are effective in predicting CLL treatment. • Gradient Boosting Machines perform best among Machine Learning models. • Using only inexpensive features does not greatly reduce predictive performance. • Our model can be explained using a decision tree surrogate model.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
3秒前
yushiolo完成签到 ,获得积分10
9秒前
hy完成签到 ,获得积分10
13秒前
江東完成签到 ,获得积分10
16秒前
elisa828完成签到,获得积分10
17秒前
18秒前
忧虑的静柏完成签到 ,获得积分10
25秒前
yukky发布了新的文献求助10
28秒前
29秒前
离蒲完成签到 ,获得积分10
33秒前
Ava应助Sandy采纳,获得10
34秒前
39秒前
瘦瘦的枫叶完成签到 ,获得积分10
42秒前
龚瑶完成签到 ,获得积分10
44秒前
乐观的忆枫完成签到 ,获得积分10
45秒前
量子星尘发布了新的文献求助10
45秒前
贰壹完成签到 ,获得积分10
46秒前
coding完成签到,获得积分10
51秒前
baiye完成签到,获得积分10
51秒前
王波完成签到 ,获得积分10
52秒前
DZS完成签到 ,获得积分10
55秒前
临兵者完成签到 ,获得积分10
56秒前
zhang5657完成签到,获得积分10
1分钟前
一天完成签到 ,获得积分10
1分钟前
knight7m完成签到 ,获得积分10
1分钟前
奋斗的妙海完成签到 ,获得积分0
1分钟前
婉莹完成签到 ,获得积分0
1分钟前
魔幻的从丹完成签到 ,获得积分10
1分钟前
自觉曲奇完成签到 ,获得积分10
1分钟前
zw完成签到,获得积分10
1分钟前
zhangshenrong完成签到 ,获得积分10
1分钟前
安心完成签到 ,获得积分10
2分钟前
Kelly Lau发布了新的文献求助10
2分钟前
小小虾完成签到 ,获得积分10
2分钟前
CCCCCL完成签到,获得积分10
2分钟前
长柏完成签到 ,获得积分10
2分钟前
高高的从波完成签到,获得积分10
2分钟前
sunwsmile完成签到 ,获得积分10
2分钟前
Peter完成签到 ,获得积分10
2分钟前
qin202569完成签到,获得积分10
2分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Encyclopedia of Forensic and Legal Medicine Third Edition 5000
Introduction to strong mixing conditions volume 1-3 5000
Agyptische Geschichte der 21.30. Dynastie 3000
Aerospace Engineering Education During the First Century of Flight 2000
从k到英国情人 1700
„Semitische Wissenschaften“? 1510
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5773271
求助须知:如何正确求助?哪些是违规求助? 5608981
关于积分的说明 15430729
捐赠科研通 4905828
什么是DOI,文献DOI怎么找? 2639838
邀请新用户注册赠送积分活动 1587741
关于科研通互助平台的介绍 1542719