计算机科学
人工智能
萃取(化学)
匹配(统计)
计算机视觉
图像分辨率
对象(语法)
遥感
水体
模式识别(心理学)
特征提取
图像匹配
领域(数学分析)
地质学
图像(数学)
数学
统计
数学分析
化学
岩土工程
色谱法
作者
Zhen Li,Qiqi Zhu,Jiahui Yang,Jianjun Lv,Qingfeng Guan
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:62: 1-15
标识
DOI:10.1109/tgrs.2024.3407200
摘要
Large-scale information pertaining to surface water bodies is crucial for activities such as flood monitoring. Deep learning algorithms have shown great potential in water-body extraction based on high spatial resolution (HSR) imagery. However, the current reliance on deep learning for HSR imagery water-body extraction necessitates a substantial quantity of manually labeled training samples. The variance in spatial resolution among images and the intricacies of scenes consistently pose challenges to the transferability of deep learning. Moreover, the number of pixels representing water bodies is typically lower compared to the number of background pixels. This imbalance in class prediction probabilities often limits the accuracy of water-body class predictions. In this paper, we propose a cross-domain object-semantic matching (COM) framework for extracting water bodies from unlabeled high-resolution remote sensing imagery. The distinctions in spectra, shapes, and semantic distributions of water bodies across various domains create challenges for certain source domain samples to contribute positively to model training. Therefore, a sample semantic similarity matching mechanism is devised. The proposed object contextual perception network (OCPNet) models multi-scale water body features and object-contextual representations, aiming to achieve a more accurate and comprehensive representation of surface water bodies. Additionally, to prevent the training process from being dominated by easily transferred categories in the target domain, a weighted joint loss is designed to alleviate the imbalance of predicted probabilities and pixel numbers between water and non-water bodies. Experiments on four public datasets of GID, CCF, LoveDA and DeepGlobe demonstrate the effectiveness and generalization of our proposed framework.
科研通智能强力驱动
Strongly Powered by AbleSci AI