计算机科学
判别式
人工智能
机器学习
标记数据
启发式
模式识别(心理学)
稳健性(进化)
数据挖掘
生物化学
化学
基因
操作系统
作者
Priyanka Gupta,Saandra Nandakumar,Manik Gupta,Ganapati Panda
标识
DOI:10.1016/j.bspc.2023.105540
摘要
Electrocardiogram (ECG) beat labeling performed using conventional methods is unsuitable for ECG signals obtained from Internet of Things (IoT) wearable devices. The conventional methods employ manually labeled data captured using multiple leads, while most IoT devices produce unlabeled single lead data. Getting ECG data labeled by a subject matter expert (SME) is a resource/time/cost-intensive task. Our research addresses this challenge by proposing an automatic labeling technique for ECG time-series data obtained from a single lead. The technique employs a data programming (DP) enabled weak supervised learning (WSL) technique for automatic labeling of ECG beats. We have proposed nine novel heuristics-based labeling functions (LFs), applied them to each ECG beat and subsequently used a generative model (GM) to assign a probabilistic label to each ECG beat employing both intra and inter-patient paradigm on MIT-BIH and INCART datasets. Further, a discriminative model (DM) is trained on top of the GM for maximizing data coverage and ensure robustness. Finally, data augmentation (DA) is used to solve the class imbalance problem inherent in ECG data. Our experimental results demonstrate a simpler, faster, and more accurate labeling method i.e., approximately 105 ECG beats are labeled in an hour with an accuracy of 92.2% from single lead data. In contrast with human annotators, the time and cost requirements of our proposed labeling method are significantly less.
科研通智能强力驱动
Strongly Powered by AbleSci AI