障碍物
棱锥(几何)
计算机科学
人工智能
保险丝(电气)
特征(语言学)
计算机视觉
行人检测
骨干网
深度学习
模式识别(心理学)
特征提取
钥匙(锁)
行人
工程类
数学
语言学
哲学
几何学
电气工程
计算机安全
政治学
运输工程
法学
计算机网络
作者
Zhenwei Li,Wei Zhang,Xiaoli Yang
出处
期刊:Electronics
[MDPI AG]
日期:2023-05-14
卷期号:12 (10): 2228-2228
被引量:1
标识
DOI:10.3390/electronics12102228
摘要
Timely detection of dynamic and static obstacles and accurate identification of signal lights using image processing techniques is one of the key technologies for guidance robots and is a necessity to assist blind people with safe travel. Due to the complexity of real-time road conditions, current obstacle and traffic light detection methods generally suffer from missed detection and false detection. In this paper, an improved deep learning model based on YOLOv5 is proposed to address the above problems and to achieve more accurate and faster recognition of different obstacles and traffic lights that the blind may encounter. In this model, a coordinate attention layer is added to the backbone network of YOLOv5 to improve its ability to extract effective features. Then, the feature pyramid network in YOLOv5 is replaced with a weighted bidirectional feature pyramid structure to fuse the extracted feature maps of different sizes and obtain more feature information. Finally, a SIoU loss function is introduced to increase the angle calculation of the frames. The proposed model’s detection performance for pedestrians, vehicles, and traffic lights under different conditions is tested and evaluated using the BDD100K dataset. The results show that the improved model can achieve higher mean average precision and better detection ability, especially for small targets.
科研通智能强力驱动
Strongly Powered by AbleSci AI