失败
帕斯卡(单位)
目标检测
门
计算机科学
变压器
人工智能
计算机视觉
实时计算
工程类
模式识别(心理学)
电气工程
并行计算
操作系统
电压
程序设计语言
作者
Yuan Dai,Weiming Liu,Heng Wang,Wei Xie,Kejun Long
出处
期刊:IEEE Transactions on Instrumentation and Measurement
[Institute of Electrical and Electronics Engineers]
日期:2022-01-01
卷期号:71: 1-14
被引量:26
标识
DOI:10.1109/tim.2022.3219468
摘要
The automatic detection of foreign objects between platform screen doors (PSDs) and metro train doors significantly affects personnel and property safety and maintains the train’s normal operation. However, some existing works only determine the presence of foreign objects but cannot indicate their categories. Besides, although deep-learning-based object detection algorithms can indicate the presence and categories of foreign objects, most of them only harness the information in region proposals, ignoring global contextual information. Furthermore, their performance comes at the considerable cost of computational complexity, and leading cannot be well deployed in the metro environment. To address these issues and better implement foreign object detection (FOD), we present You Only Look Once-Transformer (YOLO-Former), a simple but efficient model. YOLO-Former is accomplished based on YOLOv5 through the following procedure. First, the vision transformer (ViT) is introduced for dynamic attention and global modeling, thereby solving the problem that the original YOLOv5 only utilizes information in region proposals and has insufficient ability to capture global information. Second, the convolutional block attention module (CBAM) and Stem module are used to improve feature expression ability further and reduce floating point operations (FLOPs). Finally, we design various variants with different widths and depths to meet every need. Experiments on the foreign object detection dataset (FODD) and PASCAL VOC dataset demonstrate that YOLO-Former-x consistently outperforms other state-of-the-arts with significant margins (0.5 to 11.3 mean average precision, mAP, on FODD and 0.6 to 13.6 on PASCAL VOC dataset). Last but not least, YOLO-Former-x maintains real-time processing speed (27.32 and 28.17 frame per second, FPS, on TITAN Xp).
科研通智能强力驱动
Strongly Powered by AbleSci AI