目标检测
计算机科学
趋同(经济学)
任务(项目管理)
人工智能
对象(语法)
频道(广播)
骨干网
网络模型
职位(财务)
模式识别(心理学)
极限(数学)
人工神经网络
图像(数学)
机器学习
算法
数学
工程类
电信
数学分析
系统工程
财务
经济
经济增长
作者
Xingzhu Liang,Wei Cheng,Chunjiong Zhang,Lixin Wang,Xinyun Yan,Qing Chen
出处
期刊:IEEE Transactions on Consumer Electronics
[Institute of Electrical and Electronics Engineers]
日期:2023-05-25
卷期号:69 (4): 775-785
被引量:5
标识
DOI:10.1109/tce.2023.3278264
摘要
Object detection includes three subtasks of predicting target position, classification, and confidence. In the mainstream object detection model, the model pursues internal structure refinement, and each subtask shares almost the same structure, which is a task-coupled structure. The task-coupled structure of the model reduces the training parameters, but it cannot be tuned on the network structure for each task separately, which can limit the model performance. We designed a task decoupled object detection network (YOLOD) based on YOLOv5, where YOLOD is decoupled immediately after the backbone network. By observing the loss convergence of each subtask, three network structures are designed separately and the branch size is controlled so that the model has fewer training parameters. At the same time, some experimental adjustments were made to YOLOD to accelerate the convergence speeds of the model. In addition, we add image contour information to the original three-channel image to assist model training and improve detection accuracy. The experiments demonstrate that the modified model is smaller in size and has the largest accuracy improvement on the small-scale model. without introducing any attention-based modules, YOLOD-S achieves a mAP improvement of 1.1% on the MS COCO dataset and 2.29% on the VOC dataset, and the larger model YOLOD-L achieves an accuracy of 48.8% on the COCO dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI