DF2 RQ: Dynamic Feature Fusion via Region-wise Queries for Semantic Segmentation of Multimodal Remote Sensing Data

计算机科学模式特征（语言学）人工智能模态（人机交互）模式识别（心理学）传感器融合判别式融合语义学（计算机科学）分割语言学哲学社会科学社会学程序设计语言

作者

Shiyang Feng,Zhaowei Li,Bo Zhang,Bin Wang

出处

期刊：IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
日期：2025-01-01 卷期号：: 1-1

标识

DOI：10.1109/tgrs.2025.3526247

摘要

Although remote sensing (RS) data with multiple modalities can be used to significantly improve the accuracy of semantic segmentation in RS data, how to effectively extract multimodal information through multimodal feature fusion remains a challenging task. Specifically, existing methods for multimodal feature fusion still face two major challenges: 1) Due to the diverse imaging mechanisms of multimodal RS data, the boundaries of the same foreground may vary across different modalities, leading to the inclusion of unwanted background semantics in the fused foreground features; 2) RS data from different modalities exhibit varying discriminative abilities for different foregrounds, making it challenging to determine the proportion of semantic information for each modality in the fusion results. To address the above issues, we propose a dynamic feature fusion method based on region-wise queries, namely DF ² RQ, for SS of multimodal RS data. This method is primarily composed of two components: the spatial reconstruction (SR) module and the dynamic fusion (DF) module. Within the SR module, we propose a spatial reconstruction scheme that samples foreground features from different modalities, achieving independent reconstruction of different unimodal features, thereby alleviating the semantic mixing between foreground and background across modalities. In the DF module, a feature fusion scheme based on unimodal feature reference positions is proposed to obtain fusion weights for each modality, thereby enabling the dynamic fusion of complementary features from multiple modalities. The performance of the proposed method has been extensively evaluated on various multimodal RS datasets for SS, and the experimental results consistently show that the proposed method achieves state-of-the-art accuracy on multiple commonly used metrics. In addition, our code is available at https://github.com/I3ab/DF2RQ.

求助该文献

最长约 10秒，即可获得该文献文件

DF2 RQ: Dynamic Feature Fusion via Region-wise Queries for Semantic Segmentation of Multimodal Remote Sensing Data

今日热心研友