计算机科学
特征(语言学)
增采样
代表(政治)
水准点(测量)
边距(机器学习)
模式识别(心理学)
块(置换群论)
背景(考古学)
网(多面体)
计算机视觉
人工智能
对象(语法)
目标检测
图像(数学)
机器学习
数学
地图学
地理
哲学
法学
考古
政治
语言学
政治学
几何学
作者
Lisha Cui,Pei Lv,Xiaoheng Jiang,Zhimin Gao,Bing Zhou,Luming Zhang,Ling Shao,Mingliang Xu
标识
DOI:10.1109/tcyb.2020.3004636
摘要
State-of-the-art object detectors usually progressively downsample the input image until it is represented by small feature maps, which loses the spatial information and compromises the representation of small objects. In this article, we propose a context-aware block net (CAB Net) to improve small object detection by building high-resolution and strong semantic feature maps. To internally enhance the representation capacity of feature maps with high spatial resolution, we delicately design the context-aware block (CAB). CAB exploits pyramidal dilated convolutions to incorporate multilevel contextual information without losing the original resolution of feature maps. Then, we assemble CAB to the end of the truncated backbone network (e.g., VGG16) with a relatively small downsampling factor (e.g., 8) and cast off all following layers. CAB Net can capture both basic visual patterns as well as semantical information of small objects, thus improving the performance of small object detection. Experiments conducted on the benchmark Tsinghua-Tencent 100K and the Airport dataset show that CAB Net outperforms other top-performing detectors by a large margin while keeping real-time speed, which demonstrates the effectiveness of CAB Net for small object detection.
科研通智能强力驱动
Strongly Powered by AbleSci AI