膨胀(度量空间)
卷积(计算机科学)
计算机科学
人工智能
分割
算法
模式识别(心理学)
重叠-添加方法
目标检测
图像分割
数学优化
数学
傅里叶变换
数学分析
组合数学
分数阶傅立叶变换
傅里叶分析
人工神经网络
作者
Jie Liu,Chuming Li,Feng Liang,Lin Chen,Ming Sun,Junjie Yan,Wanli Ouyang,Dong Xu
标识
DOI:10.1109/cvpr46437.2021.01132
摘要
As a variant of standard convolution, a dilated convolution can control effective receptive fields and handle large scale variance of objects without introducing additional computational costs. To fully explore the potential of dilated convolution, we proposed a new type of dilated convolution (referred to as inception convolution), where the convolution operations have independent dilation patterns among different axes, channels and layers. To develop a practical method for learning complex inception convolution based on the data, a simple but effective search algorithm, referred to as efficient dilation optimization (EDO), is developed. Based on statistical optimization, the EDO method operates in a low-cost manner and is extremely fast when it is applied on large scale datasets. Empirical results validate that our method achieves consistent performance gains for image recognition, object detection, instance segmentation, human detection, and human pose estimation. For instance, by simply replacing the 3 × 3 standard convolution in the ResNet-50 backbone with inception convolution, we significantly improve the AP of Faster R-CNN from 36.4% to 39.2% on MS COCO.
科研通智能强力驱动
Strongly Powered by AbleSci AI