计算机科学
人工智能
编码器
机器学习
特征学习
任务(项目管理)
代表(政治)
情态动词
特征(语言学)
领域(数学分析)
模式识别(心理学)
语音识别
工程类
数学分析
哲学
操作系统
政治
化学
高分子化学
法学
系统工程
语言学
数学
政治学
作者
Che Liu,Zhongwei Wan,Sibo Cheng,Mi Zhang,Rossella Arcucci
标识
DOI:10.1109/icassp48485.2024.10446742
摘要
In the domain of cardiovascular healthcare, the Electrocardiogram (ECG) serves as a critical, non-invasive diagnostic tool. Although recent strides in self-supervised learning (SSL) have been promising for ECG representation learning, these techniques often require annotated samples and struggle with classes not present in the fine-tuning stages. To address these limitations, we introduce ECG-Text Pre-training (ETP), an innovative framework designed to learn cross-modal representations that link ECG signals with textual reports. For the first time, this framework leverages the zero-shot classification task in the ECG domain. ETP employs an ECG encoder along with a pre-trained language model to align ECG signals with their corresponding textual reports. The proposed framework excels in both linear evaluation and zero-shot classification tasks, as demonstrated on the PTB-XL and CPSC2018 datasets, showcasing its ability for robust and generalizable cross-modal ECG feature learning.
科研通智能强力驱动
Strongly Powered by AbleSci AI