A retrospective study using machine learning to develop predictive model to identify rotavirus-associated acute gastroenteritis in children

人工智能 机器学习 随机森林 支持向量机 朴素贝叶斯分类器 特征选择 逻辑回归 轮状病毒 决策树 医学 精确性和召回率 接收机工作特性 特征(语言学) 腹泻 计算机科学 内科学 语言学 哲学
作者
Sourav Paul,Minhazur Rahman,Anutee Dolley,Kasturi Saikia,Chongtham Shyamsunder Singh,Arifullah Mohammed,Ghazala Muteeb,Rosy Sarmah,Nima D. Namsa
出处
期刊:PeerJ [PeerJ, Inc.]
卷期号:13: e19025-e19025
标识
DOI:10.7717/peerj.19025
摘要

Background Rotavirus is the leading cause of severe dehydrating diarrhea in children under 5 years worldwide. Timely diagnosis is critical, but access to confirmatory testing is limited in hospital settings. Machine learning (ML) models have shown promising potential in supporting symptom-based diagnosis of several diseases in resource-limited settings. Objectives This study aims to develop a machine-learning predictive model integrated with multiple sources of clinical parameters specific to rotavirus infection without relying on laboratory tests. Methods A clinical dataset of 509 children was collected in collaboration with the Regional Institute of Medical Sciences, Imphal, India. The clinical symptoms included diarrhea and its duration, number of stool episodes per day, fever, vomiting and its duration, number of vomiting episodes per day, temperature and dehydration. Correlation analysis is performed to check the feature-feature and feature-outcome collinearity. Feature selection using ANOVA F test is carried out to find the feature importance values and finally obtain the reduced feature subset. Seven supervised learning models were tested and compared viz., support vector machine (SVM), K-nearest neighbor (KNN), naive Bayes (NB), logistic regression (Log_R) , random forest (RF), decision tree (DT), and XGBoost (XGB). A comparison of the performances of the seven models using the classification results obtained. The performance of the models was evaluated based on accuracy, precision, recall, specificity, F1 score, macro F1, F2, and receiver operator characteristic curve. Results The seven ML models were exhaustively experimented on our dataset and compared based on eight evaluation scores which are accuracy, precision, recall, specificity, F1 score, F2 score, macro F1 score, and AUC values computed. We observed that when the seven ML models were applied, RF performed the best with an accuracy of 81.4%, F1 score of 86.9%, macro F1-score of 77.3%, F2 score of 86.5% and area under the curve (AUC) of 89%. Conclusions The machine learning models can contribute to predicting symptom-based diagnosis of rotavirus-associated acute gastroenteritis in children, especially in resource-limited settings. Further validation of the models using a large dataset is needed for predicting pediatric diarrheic populations with optimum sensitivity and specificity.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
庆爷完成签到,获得积分10
刚刚
求助人发布了新的文献求助10
刚刚
XQQDD应助柔弱紊采纳,获得30
刚刚
1秒前
yuyu完成签到,获得积分10
1秒前
1秒前
hhht123完成签到 ,获得积分10
2秒前
2秒前
yrutao完成签到,获得积分10
2秒前
Owen应助1222采纳,获得10
2秒前
tian悦发布了新的文献求助30
2秒前
Lucas应助lin采纳,获得10
3秒前
gu完成签到,获得积分20
3秒前
源于期待发布了新的文献求助10
3秒前
gjt发布了新的文献求助10
4秒前
闲云野鹤完成签到,获得积分10
4秒前
4秒前
小琳完成签到,获得积分10
4秒前
cc完成签到,获得积分10
4秒前
上官若男应助Timo干物类采纳,获得10
4秒前
前进的泡影完成签到,获得积分10
4秒前
5秒前
wxx发布了新的文献求助10
5秒前
yyyyy完成签到,获得积分10
5秒前
发嗲的蓉关注了科研通微信公众号
5秒前
5秒前
都安完成签到,获得积分10
5秒前
打打应助yuyu采纳,获得10
6秒前
无私夜雪发布了新的文献求助10
6秒前
wzait07完成签到,获得积分10
6秒前
moonlin发布了新的文献求助10
6秒前
6秒前
研友_Raven发布了新的文献求助10
6秒前
7秒前
刘可歆完成签到,获得积分10
7秒前
yyc完成签到,获得积分10
7秒前
卡卡完成签到,获得积分10
8秒前
饭后瞌睡完成签到,获得积分10
8秒前
邵洋发布了新的文献求助10
9秒前
呜呼完成签到,获得积分10
9秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Organometallic Chemistry of the Transition Metals 800
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6437487
求助须知:如何正确求助?哪些是违规求助? 8251936
关于积分的说明 17557101
捐赠科研通 5495747
什么是DOI,文献DOI怎么找? 2898511
邀请新用户注册赠送积分活动 1875316
关于科研通互助平台的介绍 1716303