计算机科学
背景(考古学)
变更检测
人工智能
特征(语言学)
空间语境意识
空间分析
模式识别(心理学)
遥感
地理
语言学
哲学
考古
作者
Wen Xiao,Hui Cao,Yuqi Lei,Qiqi Zhu,Nengcheng Chen
出处
期刊:International journal of applied earth observation and geoinformation
日期:2024-08-01
卷期号:132: 104075-104075
标识
DOI:10.1016/j.jag.2024.104075
摘要
Accurate detection of changes in buildings is crucial for the understanding of urban development. The growing accessibility of remote sensing imagery has enabled urban scale change detection (CD) in both 2D and 3D. However, existing methods have not yet fully exploited the fusion of feature information in multi-temporal images, resulting in insufficient accuracy in 2D changed regions or in elevation changes. To this end, a Cross-temporal and Spatial Context Learning Network (CSCLNet) aimed at multi-task building CD from dual-temporal optical images is proposed, capturing both 2D and 3D changes simultaneously. It leverages a CNN network to extract multi-layer semantic features. Subsequently, two modules, Cross-temporal Transformer Semantic Enhancement (CTSE) and Multi-layer Feature Fusion (MFF), are developed to refine the feature representations. CTSE enhances temporal information by cross attention of dual-temporal features to enable interactions and MFF fuses multi-layer features and enhances attention to global and local spatial context. Finally, two prediction heads are introduced to separately handle 2D and 3D change prediction, identifying changed building objects and their elevation changes. Experiments conducted with two public datasets, 3DCD and SMARS, show that the CSCLNet achieves state-of-the-art for both 2D and 3D CD tasks. In particular, the change-specific RMSE of elevation changes has been reduced to 4.52 m in real world scenes. The code is available at: https://github.com/Geo3DSmart/CSCLNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI