残余物
计算机科学
计算机视觉
人工智能
变压器
数据挖掘
工程类
电压
算法
电气工程
作者
Yifei Chen,Ye Jin,Wei Yang,Weixin Hu,Zhihui Zhang,Chongzhou Wang,Xuechen Jiao
标识
DOI:10.1117/1.jei.34.1.013003
摘要
We propose a learned inverted residual cascaded group small target detection transformer (LIS-DETR) model, a novel approach for small object detection in autonomous driving. This model has a unique backbone network, the basic inverted residual cascaded group mobile block, which enhances feature representation and reduces computational redundancy. A dedicated small detection layer is integrated to improve small object detection specifically. In addition, an adaptive learned positional encoding transformer layer is incorporated to strengthen global contextual relationships, and the designed inner-SIoU loss function further accelerates convergence speed. Experimental results show a 3.1% increase in mAP50 accuracy on VisDrone datasets and a 1.9% improvement on processed SODA10m datasets compared with baseline methods. These advances demonstrate the LIS-DETR model's strong generalization ability and the significant potential to enhance the efficacy of autonomous driving systems.
科研通智能强力驱动
Strongly Powered by AbleSci AI