计算机科学
帕斯卡(单位)
骨干网
特征(语言学)
人工智能
深度学习
融合机制
目标检测
光学(聚焦)
融合
模式识别(心理学)
数据挖掘
计算机网络
物理
哲学
语言学
脂质双层融合
光学
程序设计语言
作者
Chao Jiang,Hao Zhang,Yunliang Yue,Xuelong Hu
标识
DOI:10.1109/itoec53115.2022.9734536
摘要
To address the problem that target detection models such as FASTER RCNN, YOLO and SSD focus too much on the depth of the network and neglect to make full use of the deep semantic feature information of the image, this paper proposes a new network: AM-YOLO. The network makes full use of contextual relationship between shallow and deep layers to achieve multi-feature fusion of the target. In AM-YOLO, SE blocks are firstly added in the backbone network to differentiate the channel importance of feature maps. Then a new path aggregation network is proposed to achieve the full fusion of shallow and deep features. This paper uses YOLOV4 as the baseline, PASCAL VOC07+12 for dataset and the experimental results show that on the 3060 GPU, the map of AM-YOLO is improved by 2.86% compared with YOLOV4 model, which validates the comprehensive performance of AM-YOLO.
科研通智能强力驱动
Strongly Powered by AbleSci AI