卷积(计算机科学)
块(置换群论)
卷积神经网络
编码(集合论)
编码(内存)
光学(聚焦)
构造(python库)
计算机科学
频道(广播)
特征(语言学)
解码方法
人工智能
理论计算机科学
模式识别(心理学)
计算机工程
算法
人工神经网络
程序设计语言
电信
哲学
物理
光学
集合(抽象数据类型)
语言学
数学
几何学
作者
Jie Hu,Li Shen,Gang Sun
出处
期刊:arXiv: Computer Vision and Pattern Recognition
日期:2018-06-01
被引量:18131
标识
DOI:10.1109/cvpr.2018.00745
摘要
Convolutional neural networks are built upon the convolution operation, which extracts informative features by fusing spatial and channel-wise information together within local receptive fields. In order to boost the representational power of a network, several recent approaches have shown the benefit of enhancing spatial encoding. In this work, we focus on the channel relationship and propose a novel architectural unit, which we term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels. We demonstrate that by stacking these blocks together, we can construct SENet architectures that generalise extremely well across challenging datasets. Crucially, we find that SE blocks produce significant performance improvements for existing state-of-the-art deep architectures at minimal additional computational cost. SENets formed the foundation of our ILSVRC 2017 classification submission which won first place and significantly reduced the top-5 error to 2.251%, achieving a ~25% relative improvement over the winning entry of 2016. Code and models are available at https://github.com/hujie-frank/SENet.
科研通智能强力驱动
Strongly Powered by AbleSci AI