计算机科学
探测器
目标检测
对象(语法)
编码(集合论)
建筑
人工智能
源代码
航程(航空)
国家(计算机科学)
计算机视觉
模式识别(心理学)
集合(抽象数据类型)
操作系统
程序设计语言
工程类
艺术
电信
视觉艺术
航空航天工程
作者
Chien-Yao Wang,Alexey Bochkovskiy,Hong-Yuan Mark Liao
标识
DOI:10.1109/cvpr52729.2023.00721
摘要
Real-time object detection is one of the most important research topics in computer vision. As new approaches regarding architecture optimization and training optimization are continually being developed, we have found two research topics that have spawned when dealing with these latest state-of-the-art methods. To address the topics, we propose a trainable bag-of-freebies oriented solution. We combine the flexible and efficient training tools with the proposed architecture and the compound scaling method. YOLOv7 surpasses all known object detectors in both speed and accuracy in the range from 5 FPS to 120 FPS and has the highest accuracy 56.8% AP among all known real-time object detectors with 30 FPS or higher on GPU V100. Source code is released in https://github.com/WongKinYiu/yolov7.
科研通智能强力驱动
Strongly Powered by AbleSci AI