Detecting Ventricular Beats with Machine Learning Models

心跳 计算机科学 特征选择 人工智能 随机森林 试验装置 模式识别(心理学) 机器学习 特征(语言学) 试验数据 数据集 水准点(测量) 分类器(UML) 数据挖掘 人工神经网络 二元分类 训练集 支持向量机 程序设计语言 地理 哲学 语言学 计算机安全 大地测量学
作者
Stojancho Tudjarski,Aleksandar Stankovski,Marjan Gušev
标识
DOI:10.23919/mipro55190.2022.9803758
摘要

This paper aims at modeling a classifier of Ventricular heartbeats by experimenting with the most advanced classic binary classifiers in different scenarios for feature engineering. Methodology: The results were acquired based on experimenting with XGBoost and Random Forest algorithms, as two of the most advanced classifiers not based on neural networks. Although the annotated ECG data sets contain records with several heartbeat classes, we focus on a model that would distinguish V from others (Non-V heartbeats). Considering that we are dealing with a highly imbalanced data set, we applied the SMOTE algorithm for data enrichment to provide a better-balanced data set for training the model. To acquire better results, we added new calculated features, with and without feature selection. For feature selection, we used the Fisher Selector algorithm. Data: We used MIT-BIH Arrhythmia benchmark database, with train/test split according to the patient-oriented splitting approach that separates the original dataset into two subsets with approximately equal sizes and distribution of heartbeat types. Conclusion: The best results are achieved with XGBoost algorithm with original feature set. We achieved precision of 91.36%, recall of 88.31% and F1 score of 89.81%. Results showed that oversampling does not provide significantly better overall model performance. Still, we would recommend this approach since in practice, when dealing with imbalanced data sets, this leads to more robust models that perform better with data outside the training and test sets, such as when the model is used in production.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
夏雨完成签到,获得积分10
刚刚
1秒前
Gandiva发布了新的文献求助10
2秒前
fys完成签到,获得积分10
2秒前
xh完成签到 ,获得积分10
2秒前
麦子发布了新的文献求助10
2秒前
3秒前
keyanyan完成签到,获得积分10
3秒前
畅畅儿歌完成签到,获得积分10
4秒前
YOUNG-M完成签到,获得积分10
4秒前
hana完成签到,获得积分10
4秒前
Willa应助b不为谁而作的歌采纳,获得10
5秒前
天天快乐应助研友_LmVygn采纳,获得10
6秒前
li完成签到,获得积分10
6秒前
自觉谷南发布了新的文献求助10
6秒前
精明芷巧完成签到 ,获得积分10
6秒前
nihao完成签到,获得积分10
6秒前
godblessyou应助charry采纳,获得10
7秒前
EMP完成签到,获得积分20
7秒前
ll完成签到 ,获得积分10
7秒前
能干戒指完成签到,获得积分10
7秒前
宫野志保发布了新的文献求助30
8秒前
温茹完成签到 ,获得积分10
9秒前
freshabc完成签到,获得积分10
9秒前
华仔应助liu采纳,获得10
9秒前
小花完成签到 ,获得积分10
10秒前
小猫吃鱼完成签到,获得积分10
10秒前
zyy完成签到,获得积分10
10秒前
littlebenk完成签到,获得积分10
11秒前
wanci应助月亮采纳,获得10
11秒前
传奇3应助陌路孤星采纳,获得10
11秒前
微微发布了新的文献求助20
12秒前
咎淇完成签到,获得积分10
13秒前
LW完成签到,获得积分10
14秒前
erkk完成签到,获得积分20
14秒前
14秒前
ding应助Gandiva采纳,获得10
14秒前
yd完成签到,获得积分10
14秒前
不吃香菜完成签到,获得积分10
15秒前
sqxl发布了新的文献求助20
15秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Bounds for Statistical Estimation in Semiparametric Models 500
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6474264
求助须知:如何正确求助?哪些是违规求助? 8277071
关于积分的说明 17648633
捐赠科研通 5554880
什么是DOI,文献DOI怎么找? 2909942
邀请新用户注册赠送积分活动 1886699
关于科研通互助平台的介绍 1739255