棱锥(几何)
计算机科学
对象(语法)
目标检测
人工智能
计算机视觉
模式识别(心理学)
数学
几何学
作者
Xiangyan Tang,Wentian Xu,Keqiu Li,Mengxue Han,Zhizhong Ma,Ruili Wang
标识
DOI:10.1016/j.ins.2024.120576
摘要
Object detection is a challenging task that requires a trade-off between accuracy and efficiency. Previous approaches have focused mainly on optimizing one aspect at the expense of the other, making them unsuitable for resource-constrained devices. To address this issue, we propose a new object detection network architecture, the Pyramid Integration and Attention Enhanced Network (PIAENet). PIAENet is a lightweight architecture that can achieve high accuracy and efficiency. We utilize a lightweight EfficientNet-B2 backbone for feature extraction to maintain accuracy while reducing computational overhead. The core components of PIAENet, the Pyramid Integration Module (PIM) and the Attention Enhanced Module (AEM), work together to improve the performance of object detection. PIM fuses multi-scale features using multiple branches to enhance the receptive field of the model, while AEM strengthens the fusion of features using two attention mechanisms to suppress the influence of irrelevant information. Our proposed method has been evaluated on the PASCAL VOC and KITTI datasets. The results have shown our method outperforms most of the existing state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI