计算机科学
人工智能
分割
RGB颜色模型
特征提取
特征(语言学)
杠杆(统计)
深度学习
图像分割
语义学(计算机科学)
模式识别(心理学)
计算机视觉
语义特征
语言学
哲学
程序设计语言
作者
Wujie Zhou,Jinfu Liu,Jingsheng Lei,Lu Yu,Jenq‐Neng Hwang
出处
期刊:IEEE transactions on image processing
[Institute of Electrical and Electronics Engineers]
日期:2021-01-01
卷期号:30: 7790-7802
被引量:200
标识
DOI:10.1109/tip.2021.3109518
摘要
Semantic segmentation is a fundamental task in computer vision, and it has various applications in fields such as robotic sensing, video surveillance, and autonomous driving. A major research topic in urban road semantic segmentation is the proper integration and use of cross-modal information for fusion. Here, we attempt to leverage inherent multimodal information and acquire graded features to develop a novel multilabel-learning network for RGB-thermal urban scene semantic segmentation. Specifically, we propose a strategy for graded-feature extraction to split multilevel features into junior, intermediate, and senior levels. Then, we integrate RGB and thermal modalities with two distinct fusion modules, namely a shallow feature fusion module and deep feature fusion module for junior and senior features. Finally, we use multilabel supervision to optimize the network in terms of semantic, binary, and boundary characteristics. Experimental results confirm that the proposed architecture, the graded-feature multilabel-learning network, outperforms state-of-the-art methods for urban scene semantic segmentation, and it can be generalized to depth data.
科研通智能强力驱动
Strongly Powered by AbleSci AI