遥感
计算机科学
高分辨率
变压器
图像分辨率
人工智能
计算机视觉
地质学
工程类
电压
电气工程
作者
Lipeng Gao,Yiqing Zhou,Jiangtao Tian,Wenjing Cai
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:62: 1-19
标识
DOI:10.1109/tgrs.2024.3404044
摘要
Influenced by the concepts of deep learning, extracting roads from high-resolution remote sensing scenes has gained significant attention. However, there are still limitations in both metrics and practical application scenarios. To address these limitations, we proposed a deformable and dynamic cross-transformer network (DDCTNet), introducing three key innovations. Firstly, we employed a deformable and dynamic cross-transformer (DDCT) attention module to enhance the recovery of data and structural information during the feature map upsampling by providing rich semantic information of encoding stage to decoding stage from spatial and channel dimensions, respectively, which improved the quality of upsampling while preserving the inherent characteristics of the road. Secondly, we introduced a cross-scale strip-pooling axial attention (CSSA) between discontinuous encoding stages to alleviate the information loss caused by down-sampling and highlight the linear characteristic of roads by leveraging rich semantic information from previous stage, which not only considers road linear features in complex scenes but also reduces computational complexity. Finally, we designed an auxiliary head (AuxHead) by fusing the outputs from the latter three decoding modules to enhance the model's generalization performance and convergence speed. Extensive experiments were conducted on three benchmark datasets. We also compared our DDCTNet with other classic road extraction models. The results show a noticeable improvement of 1%-5% across various evaluation metrics in three datasets. Additionally, the visualized results demonstrate that the proposed DDCTNet provides more accurate representations of real road scenes including distinguishing regions with high foreground-background similarity, addressing road occlusion, etc.
科研通智能强力驱动
Strongly Powered by AbleSci AI