计算机科学
分割
棱锥(几何)
人工智能
编码器
联营
卷积(计算机科学)
模式识别(心理学)
交叉口(航空)
图像分割
计算机视觉
人工神经网络
操作系统
物理
工程类
光学
航空航天工程
作者
Qingsong Zeng,Linxuan Zhang,Wei Wang,Xiaolong Luo,Yannan Chen
标识
DOI:10.1117/1.jei.33.4.043038
摘要
Understanding the perimeter objects and environment changes in railway scenes is crucial for ensuring the safety of train operation. Semantic segmentation is the basis of intelligent perception and scene understanding. Railway scene categories are complex and effective features are challenging to extract. This work proposes a semantic segmentation network DeepLab-Rail based on classic yet effective encoder-decoder structure. It contains a lightweight feature extraction backbone embedded with channel attention (CA) mechanism to keep computational complexity low. To enrich the receptive fields of convolutional modules, we design a parallel and cascade convolution module called compound-atrous spatial pyramid pooling and a combination of dilated convolution ratio is selected through experiments to obtain multi-scale features. To fully use the shallow features and the high-level features, efficient CA mechanism is introduced and also the mixed loss function is designed for the problem of unbalanced label categories of the dataset. Finally, the experimental results on the RailSem19 railway dataset show that the mean intersection over union reaches 65.52% and the PA reaches 88.48%. The segmentation performance of railway confusing facilities, such as signal lights and catenary pillars, has been significantly improved and surpasses other advanced methods to our best knowledge.
科研通智能强力驱动
Strongly Powered by AbleSci AI