随机森林
机器学习
人工智能
计算机科学
集成学习
支持向量机
混淆矩阵
人工神经网络
决策树
集合预报
投票
分类器(UML)
逻辑回归
政治
政治学
法学
作者
N.K. Gupta,Baij Nath Kaushik
标识
DOI:10.1093/comjnl/bxab145
摘要
Abstract There has been an increase in occurrence of human diseases all over the world. Among those, Breast Cancer has increased with an alarming rate in the past decade and this trend of increase would continue to grow. Now, there is a need for efficient text analytics and feature extraction tools to assist classifying, sharing and retrieving the information on human diseases in general and Breast Cancer in particular. In light of above, the present study has been undertaken with the objective to provide a comparative analysis of different classifiers on Breast Cancer dataset, and to propose a new ensemble training method of Machine Learning Classification. Here, machine learning models (such as K-Nearest Neighbour, Logistic Regression, Decision Tree, Random Forest, Gradient Boost, Support Vector Machine) and deep learning classifiers (such as Multi-Layer Feed Forward Neural Network, Recurrent Neural Network and Long Short Term Memory) have been applied on Breast Cancer dataset. An Ensemble Learning model for Prediction is proposed to classify the results among different classifiers. Finally, the Voting Ensemble is implemented to find out the optimal classifier for prediction of Breast Cancer. The results have been computed using the evaluation parameters such as Accuracy, Precision, Recall and Specificity. The confusion matrix drawn on the basis of evaluation parameters provides more emphasis on predicted and actual instances. Performance Evaluation for various machine learning models is computed. Results of this investigation concludes that Voting Ensemble outperforms other machine learning models. The prediction using Voting Ensemble resulted in an accuracy rate of 97.9 per cent, precision of 96.77 per cent and recall of 100 per cent.
科研通智能强力驱动
Strongly Powered by AbleSci AI