计算机科学
人工智能
分割
背景(考古学)
特征(语言学)
RGB颜色模型
计算机视觉
卷积神经网络
图像分割
骨干网
模式识别(心理学)
计算机网络
语言学
生物
哲学
古生物学
标识
DOI:10.1109/tpami.2019.2923513
摘要
Semantic segmentation of images requires an understanding of appearances of objects and their spatial relationships in scenes. The fully convolutional network (FCN) has been successfully applied to recognize objects' appearances, which are represented with RGB channels. Images augmented with depth channels provide more understanding of the geometric information of the scene in an image. In this paper, we present a multiple-branch neural network to utilize depth information to assist in the semantic segmentation of images. Our approach splits the image into layers according to the “scene-scale”. We introduce the context-aware receptive field (CARF), which provides better control of the relevant context information of learned features. Each branch of the network is equipped with CARF to adaptively aggregate the context information of image regions, leading to a more focused domain that is easier to learn. Furthermore, we propose a new zig-zag architecture to exchange information between the feature maps at different levels, augmented by the CARFs of the backbone network and decoder network. With the flexible information propagation allowed by our zig-zag network, we enrich the context information of feature maps for the segmentation. We show that the zig-zag network achieves state-of-the-art performances on several public datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI