Ensemble Learning for Disease Prediction: A Review

集成学习 Boosting(机器学习) 机器学习 计算机科学 人工智能 集合预报 堆积 分类器(UML) 疾病 随机森林 医学 病理 物理 核磁共振
作者
Palak Mahajan,Shahadat Uddin,Farshid Hajati,Mohammad Ali Moni
出处
期刊:Healthcare [Multidisciplinary Digital Publishing Institute]
卷期号:11 (12): 1808-1808 被引量:35
标识
DOI:10.3390/healthcare11121808
摘要

Machine learning models are used to create and enhance various disease prediction frameworks. Ensemble learning is a machine learning technique that combines multiple classifiers to improve performance by making more accurate predictions than a single classifier. Although numerous studies have employed ensemble approaches for disease prediction, there is a lack of thorough assessment of commonly used ensemble approaches against highly researched diseases. Consequently, this study aims to identify significant trends in the performance accuracies of ensemble techniques (i.e., bagging, boosting, stacking, and voting) against five hugely researched diseases (i.e., diabetes, skin disease, kidney disease, liver disease, and heart conditions). Using a well-defined search strategy, we first identified 45 articles from the current literature that applied two or more of the four ensemble approaches to any of these five diseases and were published in 2016-2023. Although stacking has been used the fewest number of times (23) compared with bagging (41) and boosting (37), it showed the most accurate performance the most times (19 out of 23). The voting approach is the second-best ensemble approach, as revealed in this review. Stacking always revealed the most accurate performance in the reviewed articles for skin disease and diabetes. Bagging demonstrated the best performance for kidney disease (five out of six times) and boosting for liver and diabetes (four out of six times). The results show that stacking has demonstrated greater accuracy in disease prediction than the other three candidate algorithms. Our study also demonstrates variability in the perceived performance of different ensemble approaches against frequently used disease datasets. The findings of this work will assist researchers in better understanding current trends and hotspots in disease prediction models that employ ensemble learning, as well as in determining a more suitable ensemble model for predictive disease analytics. This article also discusses variability in the perceived performance of different ensemble approaches against frequently used disease datasets.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
www发布了新的文献求助30
刚刚
言烁完成签到,获得积分10
刚刚
打打应助vince采纳,获得150
刚刚
梁其杰完成签到,获得积分10
刚刚
Owen应助满意尔槐采纳,获得10
刚刚
酱攸完成签到,获得积分10
刚刚
刚刚
爆米花应助WenjingXu采纳,获得10
2秒前
Le_long发布了新的文献求助30
2秒前
ZAO发布了新的文献求助10
2秒前
耶耶小洋完成签到 ,获得积分10
3秒前
3秒前
3秒前
杰米尼发布了新的文献求助10
4秒前
4秒前
忧郁连虎发布了新的文献求助10
5秒前
5秒前
mw发布了新的文献求助10
5秒前
liuxl完成签到,获得积分10
6秒前
charles发布了新的文献求助10
6秒前
6秒前
rf完成签到,获得积分10
6秒前
躞蹀发布了新的文献求助10
7秒前
所所应助XQQDD采纳,获得10
7秒前
7秒前
7秒前
7秒前
蒋美桥发布了新的文献求助10
7秒前
PsyHans发布了新的文献求助10
7秒前
Leila完成签到,获得积分10
8秒前
ou完成签到 ,获得积分10
8秒前
tracyfan511完成签到,获得积分10
9秒前
10秒前
10秒前
小马甲应助谦让的飞绿采纳,获得10
10秒前
科研通AI6.1应助舒心疾采纳,获得10
10秒前
Leila发布了新的文献求助10
10秒前
drughunter009发布了新的文献求助10
10秒前
Tulip发布了新的文献求助10
12秒前
12秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Developing Genetic Editing Tools for Lysobacter 2000
Моделирование процессов самоорганизации в кристаллообразующих системах 1000
History of U.S. Space Surveillance and Satellite Cataloging 1000
Adhesion Science: Principles & Practice 800
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6526177
求助须知:如何正确求助?哪些是违规求助? 8319312
关于积分的说明 17806806
捐赠科研通 5627882
什么是DOI,文献DOI怎么找? 2929577
邀请新用户注册赠送积分活动 1906217
关于科研通互助平台的介绍 1765849