计算机科学
人工智能
计算机视觉
目标检测
块(置换群论)
架空(工程)
模式识别(心理学)
数学
几何学
操作系统
作者
Qiuhao Zhang,Jiaming Tang,Haoze Zheng,Chunyu Lin
出处
期刊:Displays
[Elsevier]
日期:2022-12-01
卷期号:75: 102328-102328
被引量:6
标识
DOI:10.1016/j.displa.2022.102328
摘要
Object detection technology for images generated by optical sensors is of great significance in areas such as national defense security, disaster prediction, and smart city construction. Aiming at the problems of small target size, arbitrary target direction, and complex background in aerial image object detection using optical sensors. By using the images captured by a wild range of optical sensors such as CCD(Charge Coupled Device) and SAR(Synthetic Aperture Radar), we propose a more efficient rotating frame object detection algorithm that introduces the MCAB (Multi-branch Convolutional Attention Block) that can process these same types of images taken by the aerial platform using light sensors. First, we construct the module by adding identity, residual branch, and CBAM (Convolutional Block Attention Module) structure to the traditional convolution layer and the activation function to substitute the simple conv module in the network. Secondly, we introduce the Transformer layer at the end of the backbone network to enhance the global perception of the model with low overhead, realize the relationship is modeling between the target and the scene content, and reduce the amount of calculation. We improve the structure of the neck layer by BiFPN (Bidirectional Feature Pyramid Network) to speed up network operation, and CBAM is added after C3 to highlight important characteristic information. Our improved YOLOv5 algorithm is tested on the self-made rotating small target aerial data set. Compared with the original algorithm, our model's mean Average Precision (mAP) is improved by 1.8 percentage points, and the accuracy and recall are also enhanced by 0.46 and 0.34 percentage points.
科研通智能强力驱动
Strongly Powered by AbleSci AI