计算机科学
人工智能
面子(社会学概念)
特征(语言学)
比例(比率)
模式识别(心理学)
探测器
图层(电子)
图像分辨率
训练集
人脸检测
航程(航空)
采样(信号处理)
特征提取
面部识别系统
计算机视觉
地理
工程类
地图学
哲学
社会学
航空航天工程
电信
有机化学
化学
语言学
社会科学
作者
Ming Xiang,Fangyun Wei,Ting Zhang,Chen Dong,Fang Wen
标识
DOI:10.1109/cvpr.2019.00356
摘要
Detectors based on deep learning tend to detect multi-scale faces on a single input image for efficiency. Recent works, such as FPN and SSD, generally use feature maps from multiple layers with different spatial resolutions to detect objects at different scales, e.g., high-resolution feature maps for small objects. However, we find that such multi-layer prediction is not necessary. Faces at all scales can be well detected with features from a single layer of the network. In this paper, we carefully examine the factors affecting face detection across a large range of scales, and conclude that the balance of training samples, including both positive and negative ones, at different scales is the key. We propose a group sampling method which divides the anchors into several groups according to the scale, and ensure that the number of samples for each group is the same during training. Our approach using only the last layer of FPN as features is able to advance the state-of-the-arts. Comprehensive analysis and extensive experiments have been conducted to show the effectiveness of the proposed method. Our approach, evaluated on face detection benchmarks including FDDB and WIDER FACE datasets, achieves state-of-the-art results without bells and whistles.
科研通智能强力驱动
Strongly Powered by AbleSci AI