特征选择
随机森林
单变量
接收机工作特性
决策树
结直肠癌
人工智能
医学
逻辑回归
支持向量机
机器学习
多元统计
肿瘤科
预测建模
内科学
计算机科学
癌症
作者
Weiyuan Zhang,Xu Guan,Shuai Jiao,Guiyu Wang,Xishan Wang
出处
期刊:Ejso
[Elsevier]
日期:2023-12-01
卷期号:49 (12): 107107-107107
标识
DOI:10.1016/j.ejso.2023.107107
摘要
Background To assist clinicians with diagnosis and optimal treatment decision-making, we attempted to develop and validate an artificial intelligence prediction model for lung metastasis (LM) in colorectal cancer (CRC) patients. Methods The clinicopathological characteristics of 46037 CRC patients from the Surveillance, Epidemiology, and End Results (SEER) database and 2779 CRC patients from a multi-center external validation set were collected retrospectively. After feature selection by univariate and multivariate analyses, six machine learning (ML) models, including logistic regression, K-nearest neighbor, support vector machine, decision tree, random forest, and balanced random forest (BRF), were developed and validated for the LM prediction. In addition, stratified LM patients by risk score were utilized for survival analysis. Results Extremely low rates of LM with 2.59% and 4.50% were present in the development and validation set. As the imbalanced learning strategy, the BRF model with an Area under the receiver operating characteristic curve (AUC) of 0.874 and an average precision (AP) of 0.184 performed best compares with other models and clinical predictor. Patients with LM in the high-risk group had significantly poorer survival (P<0.001) and failed to benefit from resection (P = 0.125). Conclusions In summary, we have utilized the BRF algorithm to develop an effective, non-invasive, and practical model for predicting LM in CRC patients based on highly imbalanced datasets. In addition, we have implemented a novel approach to stratify the survival risk of CRC patients with LM based the output of the model.
科研通智能强力驱动
Strongly Powered by AbleSci AI