STFDiff: Remote sensing image spatiotemporal fusion with diffusion models

计算机科学 图像融合 扩散 融合 遥感 图像(数学) 计算机视觉 人工智能 地质学 物理 语言学 哲学 热力学
作者
He Huang,Wei He,Hongyan Zhang,Yu Xia,Liangpei Zhang
出处
期刊:Information Fusion [Elsevier BV]
卷期号:111: 102505-102505 被引量:42
标识
DOI:10.1016/j.inffus.2024.102505
摘要

Spatiotemporal fusion (STF) methods aim to blend satellite images with different spatial and temporal resolutions to support more frequent and precise monitoring. In the past decades, amounts of STF methods have been developed with remarkable success. However, among the existing methods, the traditional methods rely on the linear assumption and fail for complex and diverse scenes with great dynamics. The deep learning-based methods suffer from the spatial, temporal and spectral uncertainties in STF and the mode collapse problem of generative adversarial networks (GANs) for remote sensing images with complex scenes. To address these problems, we propose a novel spatiotemporal fusion method with diffusion models (STFDiff) that merges a coarse image at the prediction date and the coarse-fine image pairs acquired at other dates to generate the fine image at the prediction date. STFDiff generates the fine image via repeated refinement with initialized Gaussian noise under the control of the prior images acquired at other dates. At each iteration, the noise is predicted through a conditional noise predictor dual-stream Unet (DS-Unet), which enhances the noise features by subtracting the extracted features from the dual-stream encoders (DS-encoders). The noise is then gradually removed, and finally the fine image is generated with similar spatial details to the fine images and temporal dynamics to the coarse images. Comprehensive experiments on two public datasets and one personally collected dataset demonstrate that STFDiff outperforms state-of-the-art (SOTA) methods. To further verify the applicability of STFDiff on downstream tasks, we compared the K-means clustering results on the fusion images generated by different methods. The results show that the classification results of STFDiff are the most consistent with the actual images and obtain ∼2% mean intersection over union (mIoU) improvement over the SOTA methods. The source code is available at https://github.com/prowDIY/STF.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
王乐乐哈发布了新的文献求助10
1秒前
科研通AI2S应助lww采纳,获得50
1秒前
3秒前
牛牛发布了新的文献求助10
4秒前
4秒前
Hello应助XX采纳,获得10
6秒前
zbidnh完成签到,获得积分10
6秒前
潘先森发布了新的文献求助10
6秒前
传奇3应助小卢采纳,获得10
7秒前
7秒前
丨墨月丨发布了新的文献求助10
7秒前
8秒前
8秒前
顾矜应助怕黑若云采纳,获得10
8秒前
柠檬发布了新的文献求助10
8秒前
9秒前
小栩发布了新的文献求助10
9秒前
9秒前
完蛋完成签到,获得积分20
10秒前
11秒前
zzz发布了新的文献求助10
11秒前
alexlpb发布了新的文献求助10
11秒前
13秒前
godblessyou发布了新的文献求助10
14秒前
14秒前
14秒前
14秒前
柔弱机器猫完成签到,获得积分10
14秒前
宇宙星河完成签到,获得积分10
16秒前
wang发布了新的文献求助10
16秒前
16秒前
16秒前
YY发布了新的文献求助10
17秒前
zhengbiaoying完成签到,获得积分20
17秒前
Evilw1an发布了新的文献求助10
17秒前
17秒前
科目三应助善良的以南采纳,获得10
18秒前
molihuakai应助godblessyou采纳,获得10
19秒前
贤惠的冷亦完成签到,获得积分10
19秒前
随机发发布了新的文献求助10
19秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 1200
Signals, Systems, and Signal Processing 610
Software that combines deep learning,3D reconstruction and CFD to analyze the state of carotid arteries from ultrasound imaging 500
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
Adhesion Science: Principles & Practice 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6493201
求助须知:如何正确求助?哪些是违规求助? 8290657
关于积分的说明 17691570
捐赠科研通 5585361
什么是DOI,文献DOI怎么找? 2915586
邀请新用户注册赠送积分活动 1892651
关于科研通互助平台的介绍 1751038