失败
计算机科学
架空(工程)
特征(语言学)
卷积神经网络
人工智能
成对比较
频道(广播)
光学(聚焦)
人工神经网络
目标检测
模式识别(心理学)
机器学习
依赖关系(UML)
计算机工程
并行计算
计算机网络
哲学
物理
光学
操作系统
语言学
作者
Qinglong Zhang,Yu-Bin Yang
出处
期刊:International Conference on Acoustics, Speech, and Signal Processing
日期:2021-06-06
被引量:421
标识
DOI:10.1109/icassp39728.2021.9414568
摘要
Attention mechanisms, which enable a neural network to accurately focus on all the relevant elements of the input, have become an essential component to improve the performance of deep neural networks. There are mainly two attention mechanisms widely used in computer vision studies, spatial attention and channel attention, which aim to capture the pixel-level pairwise relationship and channel dependency, respectively. Although fusing them together may achieve better performance than their individual implementations, it will inevitably increase the computational overhead. In this paper, we propose an efficient Shuffle Attention (SA) module to address this issue, which adopts Shuffle Units to combine two types of attention mechanisms effectively. Specifically, SA first groups channel dimensions into multiple sub-features before processing them in parallel. Then, for each sub-feature, SA utilizes a Shuffle Unit to depict feature dependencies in both spatial and channel dimensions. After that, all sub-features are aggregated and a "channel shuffle" operator is adopted to enable information communication between different sub-features. The proposed SA module is efficient yet effective, e.g., the parameters and computations of SA against the backbone ResNet50 are 300 vs. 25.56M and 2.76e-3 GFLOPs vs. 4.12 GFLOPs, respectively, and the performance boost is more than 1.34% in terms of Top-1 accuracy. Extensive experimental results on common-used benchmarks, including ImageNet-1k for classification, MS COCO for object detection, and instance segmentation, demonstrate that the proposed SA outperforms the current SOTA methods significantly by achieving higher accuracy while having lower model complexity.
科研通智能强力驱动
Strongly Powered by AbleSci AI