计算机科学
推论
编码器
采样(信号处理)
降噪
人工智能
源代码
编码(集合论)
扩散
数据挖掘
计算机视觉
实时计算
操作系统
物理
滤波器(信号处理)
热力学
集合(抽象数据类型)
程序设计语言
作者
Tai An,Bin Xue,Chunlei Huo,Shiming Xiang,Chunhong Pan
出处
期刊:IEEE Geoscience and Remote Sensing Letters
[Institute of Electrical and Electronics Engineers]
日期:2023-11-21
卷期号:21: 1-5
被引量:5
标识
DOI:10.1109/lgrs.2023.3335421
摘要
With the emergence of diffusion models, the image generation has experienced a significant advancement. In super-resolution tasks, diffusion models surpass generative adversarial network (GAN)-based methods in generating more realistic samples. However, these models come with significant costs: denoising networks rely on large U-Net, making them computationally intensive for high-resolution (HR) images, and the extensive sampling steps in diffusion models lead to prolonged inference time. This complexity limits their application in remote sensing, due to the high demand for high-resolution images in such scenarios. To address this, we propose a lightweight diffusion model (LWTDM), which simplifies the denoising network and efficiently incorporates conditional information using a cross-attention-based encoder–decoder architecture. Furthermore, LWTDM serves as the pioneering model that incorporates the accelerated sampling technique from denoising diffusion implicit models (DDIMs). This integration involves the meticulous selection of sampling steps, ensuring the quality of the generated images. The experiments confirm that LWTDM strikes a favorable balance between precision and perceptual quality, while its faster inference speed makes it suitable for diverse remote sensing scenarios with specific requirements. The source code is available at: https://github.com/Suanmd/LWTDM .
科研通智能强力驱动
Strongly Powered by AbleSci AI