结直肠癌
梯度升压
机器学习
随机森林
医学
Boosting(机器学习)
人工智能
癌症
接收机工作特性
计算机科学
家族史
肿瘤科
内科学
作者
Ling Zheng,Elijah Eniola,Jiacun Wang
标识
DOI:10.1109/iccsi53130.2021.9736248
摘要
Colorectal cancer is the third most prevalent cancer and the second most common cause of cancer deaths in the United States. Screening is one of the most powerful tools for colorectal cancer prevention. Current screening recommendations are only based on history of colorectal cancer and age. To facilitate a more effective screening of color ectal cancer, this paper explores the feasibility of machine learning algorithms for the colorectal cancer risk prediction. The longitudinal Pancreatic, Lung, Colorectal, Ovarian Cancer dataset from the National Cancer Institute was utilized for the training and testing of eight machine learning algorithms. The experiment results show that the gradient boosting model has the largest area under the Receiver Operating Characteristics curve 0.82, and the random forest model has the highest accuracy 0.75, highest recall 0.76 and highest F1 score 0.75. The two optimal models were also used to evaluate the importance of top risk factors, which are helpful for a more effective screening recommendation.
科研通智能强力驱动
Strongly Powered by AbleSci AI