计算机科学
增采样
分割
人工智能
联营
特征(语言学)
模式识别(心理学)
背景(考古学)
棱锥(几何)
水准点(测量)
机器学习
图像(数学)
光学
物理
哲学
古生物学
生物
地理
语言学
大地测量学
作者
Wujie Zhou,Jin Jianhui,Jingsheng Lei,Lu Yu
出处
期刊:IEEE Journal of Selected Topics in Signal Processing
[Institute of Electrical and Electronics Engineers]
日期:2022-03-15
卷期号:16 (4): 666-676
被引量:48
标识
DOI:10.1109/jstsp.2022.3159032
摘要
Semantic segmentation of remote sensing images has received increasing attention in recent years; however, using a single imaging modality limits the segmentation performance. Thus, digital surface models have been integrated into semantic segmentation to improve performance. Nevertheless, existing methods based on neural networks simply combine data from the two modalities, mostly neglecting the similarities and differences between multimodal features. Consequently, the complementarity between multimodal features cannot be exploited, and excess noise is introduced during feature processing. To solve these problems, we propose a multimodal fusion module to explore the similarities and differences between features from the two information modalities for adequate fusion. In addition, although downsampling operations such as pooling and striding can improve the feature representativeness, they discard spatial details and often lead to segmentation errors. Thus, we introduce hierarchical feature interactions to mitigate the adverse effects of downsampling and introduce a two-way interactive pyramid pooling module to extract multiscale context features for guiding feature fusion. Extensive experiments performed on two benchmark datasets show that the proposed network integrating our novel modules substantially outperforms state-of-the-art semantic segmentation methods. The code and results can be found at https://github.com/NIT-JJH/CIMFNet .
科研通智能强力驱动
Strongly Powered by AbleSci AI