计算机科学
稳健性(进化)
人工智能
特征提取
棱锥(几何)
特征(语言学)
模式识别(心理学)
频道(广播)
计算机视觉
数据挖掘
机器学习
计算机网络
生物化学
化学
物理
语言学
哲学
光学
基因
作者
Guanghui Gao,Yining Guo,Lumei Zhou,Li Li,Gang Shi
出处
期刊:PLOS ONE
[Public Library of Science]
日期:2024-05-20
卷期号:19 (5): e0300017-e0300017
标识
DOI:10.1371/journal.pone.0300017
摘要
With the increasing applications of traffic scene image classification in intelligent transportation systems, there is a growing demand for improved accuracy and robustness in this classification task. However, due to weather conditions, time, lighting variations, and annotation costs, traditional deep learning methods still have limitations in extracting complex traffic scene features and achieving higher recognition accuracy. The previous classification methods for traffic scene images had gaps in multi-scale feature extraction and the combination of frequency domain, spatial, and channel attention. To address these issues, this paper proposes a multi-scale and multi-attention model based on Res2Net. Our proposed framework introduces an Adaptive Feature Refinement Pyramid Module (AFRPM) to enhance multi-scale feature extraction, thus improving the accuracy of traffic scene image classification. Additionally, we integrate frequency domain and spatial-channel attention mechanisms to develop recognition capabilities for complex backgrounds, objects of different scales, and local details in traffic scene images. The paper conducts the task of classifying traffic scene images using the Traffic-Net dataset. The experimental results demonstrate that our model achieves an accuracy of 96.88% on this dataset, which is an improvement of approximately 2% compared to the baseline Res2Net network. Furthermore, we validate the effectiveness of the proposed modules through ablation experiments.
科研通智能强力驱动
Strongly Powered by AbleSci AI