心音图
联营
计算机科学
特征(语言学)
模式识别(心理学)
人工智能
卷积(计算机科学)
信号(编程语言)
比例(比率)
块(置换群论)
棱锥(几何)
特征提取
卷积神经网络
语音识别
人工神经网络
数学
程序设计语言
哲学
语言学
物理
几何学
量子力学
作者
Junbin Zang,Cheng Lian,Bingrong Xu,Zhidong Zhang,Yixin Su,Xue Chen
标识
DOI:10.1016/j.bspc.2023.104934
摘要
Phonocardiogram (PCG) signals reflect the mechanical activity of the heart. With the automatic diagnosis of PCG signals, cardiovascular diseases can be effectively detected. This paper presents an effective PCG signal classification model, called the attentional multi-scale temporal network (AmtNet), which is constructed by one-dimensional convolution and pooling as the basic components. AmtNet can directly classify raw PCG signals without complicated feature engineering processes. Specifically, we design a multi-scale feature extraction architecture using dense connections and three different dilated convolutional paths. We adopt a one-dimensional convolutional block attention module (CBAM) to adaptively refine the intermediate feature map and design a temporal pyramid pooling layer to incorporate the multi-scale temporal information. Moreover, we further study the effectiveness of several time series data augmentation and transfer learning techniques for improving the performance of AmtNet. Extensive experiments on four public PCG datasets indicate that the proposed AmtNet can achieve competitive results for PCG signal classification.
科研通智能强力驱动
Strongly Powered by AbleSci AI