计算机科学
变更检测
人工智能
计算机视觉
棱锥(几何)
遥感
图像分辨率
变压器
高分辨率
模式识别(心理学)
地质学
工程类
光学
物理
电气工程
电压
作者
Pengyuan Lv,Mengchen Li,Yanfei Zhong
出处
期刊:IEEE Geoscience and Remote Sensing Letters
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:21: 1-5
被引量:1
标识
DOI:10.1109/lgrs.2024.3404645
摘要
The vision transformer (ViT) model has the advantage of being able to model the long-range dependencies in the imagery and has been studied for the task of remote sensing image change detection (CD). However, the performance of the existing transformer-based CD methods is not satisfactory in the case of limited labeled data. The original self-attention mechanism cannot effectively extract the change information, and the large number of parameters in the ViT model makes the model difficult to train. To solve the above-mentioned problems, a semi-supervised pyramid cross-temporal attention transformer for change detection (CT 2 RCDSS) is proposed in this letter. The CT 2 RCDSS method follows an encoder-decoder structure. The encoder utilizes a dual-branch structure, containing the combination of the proposed cross-temporal attention (PCTA) and pyramid self-attention (PSA) mechanisms, which is designed to consider the interaction of the features from different time phases and enhance the changes at different scales. In the decoder, a series of deconvolutional layers with skip connections are utilized, and a Softmax layer follows to acquire the final binary change map. In addition, a semi-supervised training strategy, which reduces the errors in the pseudo-labels generated from the models initialized with different parameters, is used to improve the model stability while using unlabeled data. The experiments showed that the proposed method can achieve a superior F1-score and intersection over union (IoU), which indicates the potential of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI