计算机科学
分割
特征(语言学)
编码器
人工智能
RGB颜色模型
模式识别(心理学)
图层(电子)
图像分割
计算机视觉
哲学
语言学
操作系统
化学
有机化学
作者
Enquan Yang,Wujie Zhou,Xionghong Qian,Lu Yu
出处
期刊:IEEE Signal Processing Letters
[Institute of Electrical and Electronics Engineers]
日期:2022-01-01
卷期号:29: 2567-2571
被引量:12
标识
DOI:10.1109/lsp.2022.3229594
摘要
RGB-D semantic segmentation of indoor scenes has long been an enduring research topic. However, because of the intrinsic differences in modal information and large gaps in multi-level feature cues, adopting the traditional U-Net framework provides suboptimal indoor scene segmentation. In this paper, we consider an effective feature exploration approach to achieve accurate segmentation. Specifically, it consists of three steps. First, in the encoder, we design a difference-exploration fusion module, which extracts the difference weights of the two modalities to guide them for fusion, so as to achieve intrinsically consistent feature fusion. The gated decoder module relates to the remaining two steps. Second, we use a gating unit for each level of fusion information to reduce the difference between layers, which also increases the unique distinction of a specific layer while avoiding the exclusion between layers of information. Finally, we use a serial-parallel alternation strategy to increase the ability to capture contextual knowledge. Considering the above three steps, we construct the multilevel gated collaborative network (MGCNet). Extensive experiments indicate the performance of the proposed MGCNet can compete favorably against state-of-the-art models under three standard metrics.
科研通智能强力驱动
Strongly Powered by AbleSci AI