卷积神经网络
计算机科学
人工智能
小波
模式识别(心理学)
水准点(测量)
特征(语言学)
块(置换群论)
频域
小波变换
离散小波变换
噪音(视频)
上下文图像分类
领域(数学分析)
特征提取
图像(数学)
计算机视觉
数学
语言学
数学分析
哲学
地理
大地测量学
几何学
作者
Xiangyu Zhao,Peng Huang,Xiangbo Shu
标识
DOI:10.1007/s00530-022-00889-8
摘要
The feature learning methods based on convolutional neural network (CNN) have successfully produced tremendous achievements in image classification tasks. However, the inherent noise and some other factors may weaken the effectiveness of the convolutional feature statistics. In this paper, we investigate Discrete Wavelet Transform (DWT) in the frequency domain and design a new Wavelet-Attention (WA) block to only implement attention in the high-frequency domain. Based on this, we propose a Wavelet-Attention convolutional neural network (WA-CNN) for image classification. Specifically, WA-CNN decomposes the feature maps into low-frequency and high-frequency components for storing the structures of the basic objects, as well as the detailed information and noise, respectively. Then, the WA block is leveraged to capture the detailed information in the high-frequency domain with different attention factors but reserves the basic object structures in the low-frequency domain. Experimental results on CIFAR-10 and CIFAR-100 datasets show that our proposed WA-CNN achieves significant improvements in classification accuracy compared to other related networks. Specifically, based on MobileNetV2 backbones, WA-CNN achieves 1.26% Top-1 accuracy improvement on the CIFAR-10 benchmark and 1.54% Top-1 accuracy improvement on the CIFAR-100 benchmark.
科研通智能强力驱动
Strongly Powered by AbleSci AI