小波
分割
融合
计算机科学
领域(数学分析)
人工智能
频域
遥感
地理
模式识别(心理学)
计算机视觉
数学
数学分析
哲学
语言学
作者
Yue-Tzu Yang,Genji Yuan,Jinjiang Li
出处
期刊:Cornell University - arXiv
日期:2024-05-03
标识
DOI:10.48550/arxiv.2405.01992
摘要
In order to fully utilize spatial information for segmentation and address the challenge of handling areas with significant grayscale variations in remote sensing segmentation, we propose the SFFNet (Spatial and Frequency Domain Fusion Network) framework. This framework employs a two-stage network design: the first stage extracts features using spatial methods to obtain features with sufficient spatial details and semantic information; the second stage maps these features in both spatial and frequency domains. In the frequency domain mapping, we introduce the Wavelet Transform Feature Decomposer (WTFD) structure, which decomposes features into low-frequency and high-frequency components using the Haar wavelet transform and integrates them with spatial features. To bridge the semantic gap between frequency and spatial features, and facilitate significant feature selection to promote the combination of features from different representation domains, we design the Multiscale Dual-Representation Alignment Filter (MDAF). This structure utilizes multiscale convolutions and dual-cross attentions. Comprehensive experimental results demonstrate that, compared to existing methods, SFFNet achieves superior performance in terms of mIoU, reaching 84.80% and 87.73% respectively.The code is located at https://github.com/yysdck/SFFNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI