计算机科学
可扩展性
棱锥(几何)
特征(语言学)
编码(集合论)
目标检测
树(集合论)
探测器
对象(语法)
失败
计算机工程
人工智能
模式识别(心理学)
并行计算
数据库
程序设计语言
数学分析
集合(抽象数据类型)
哲学
物理
光学
电信
语言学
数学
作者
Mingxing Tan,Ruoming Pang,Quoc V. Le
标识
DOI:10.1109/cvpr42600.2020.01079
摘要
Model efficiency has become increasingly important in computer vision. In this paper, we systematically study neural network architecture design choices for object detection and propose several key optimizations to improve efficiency. First, we propose a weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multi-scale feature fusion; Second, we propose a compound scaling method that uniformly scales the resolution, depth, and width for all backbone, feature network, and box/class prediction networks at the same time. Based on these optimizations and EfficientNet backbones, we have developed a new family of object detectors, called EfficientDet, which consistently achieve much better efficiency than prior art across a wide spectrum of resource constraints. In particular, with single-model and single-scale, our EfficientDet-D7 achieves state-of-the-art 52.2 AP on COCO test-dev with 52M parameters and 325B FLOPs1, being 4x - 9x smaller and using 13x - 42x fewer FLOPs than previous detector. Code is available at https://github.com/google/ automl/tree/master/efficientdet.
科研通智能强力驱动
Strongly Powered by AbleSci AI