计算机科学
联营
人工智能
目标检测
帕斯卡(单位)
卷积(计算机科学)
棱锥(几何)
核(代数)
模式识别(心理学)
特征(语言学)
推论
计算机视觉
数学
人工神经网络
组合数学
哲学
语言学
程序设计语言
几何学
作者
Xu Guo,Ming Ma,Jiaqiang Zhang,Shaojie Li
标识
DOI:10.1109/icassp49357.2023.10096629
摘要
In the coal transportation scene, the object detection model proposed for the driver behavior detection task generally has the problems of inaccurate positioning and difficult detection of small objects, we propose a new model YOLOX-B, which introduces a serialized atrous spatial pyramid pooling structure (S-ASPP), obtains different sizes of receptive field information through serialized atrous convolution, solves the problem of information loss in max-pooling, and maximizes the efficiency of atrous convolution. Meanwhile, by introducing a lightweight feature reorganization module based on transposed convolution, adaptively predicting the up-sampling kernel weight, the model can better complete pixel recovery in a weighted way and improve the detection accuracy of small objects. The experimental results on the publicly available PASCAL VOC 2012 dataset and the self-built driver behavior dataset demonstrate that the YOLOX-B maintains the same inference speed as YOLOX-S, and its mean Average Precisions(mAPs) are improved by 4.4% and 0.8%, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI