计算机科学
分割
地点
编码器
人工智能
特征提取
整体性
变压器
内存占用
模式识别(心理学)
数据挖掘
计算机视觉
工程类
哲学
语言学
电压
经济
电气工程
市场经济
全球化
操作系统
作者
Lele Xu,Ye Li,Jinzhong Xu,Yue Zhang,Lili Guo
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:61: 1-14
被引量:18
标识
DOI:10.1109/tgrs.2023.3262967
摘要
Building footprint extraction in remote sensing remains challenging due to the diverse appearances of buildings and confusing scenarios. Recently, researchers have revealed that both the globality and locality are vitally important in building footprint extraction tasks and proposed to incorporate the local context and global long-range dependency in the segmentation models. However, the inadequate integration of the globality and locality still leads to incomplete, fake or missing extraction results. To alleviate these problems, a novel segmentation method named Bi-branch Cross-fusion Transformer Network (BCTNet) is proposed in this study. Two parallel branches of the convolutional encoder branch (CB) and the transformer encoder branch (TB) are designed to extract multi-scale feature maps. A concatenation-then-cross-fusion transformer block (CCTB) is put forward to integrate the locality from the CB and globality from the TB in a cross-fusion way at each stage of the encoding process. Then, an adaptive gating module (AGM) is proposed to gate the feature maps from the CCTB to strengthen the important features while suppressing the irrelevant interference information. After that, the segmentation results can be obtained through a simple decoding process. Comprehensive experiments on two benchmark datasets demonstrate that the proposed BCTNet can achieve superior performance compared to the current state-of-the-art (SOTA) segmentation methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI