修补
快速傅里叶变换
计算机科学
人工智能
卷积神经网络
扩散
图像(数学)
计算机视觉
雷达FFT算法
模式识别(心理学)
算法
傅里叶变换
傅里叶分析
数学
分数阶傅立叶变换
数学分析
物理
热力学
作者
Yuxuan Hu,Hanting Wang,Cong Jin,Bo Li,Chunwei Tian
摘要
Diffusion models for image inpainting have been the subject of growing research interest in recent years. However, generating content that is consistent with the original images, especially for complex images with intricate details and structural information, remains a significant challenge. In this paper, we propose a diffusion model with an FFT (FFT-DM) to generate content that matches missing region texture and semantics to inpaint damaged images. Specifically, FFT-DM contains two components: a Denoising Diffusion Probabilistic Model (DDPM) and a Convolutional Neural Network (CNN). The DDPM is used to extract global features and generate image prior while the CNN captures more fine-grained details and predicts the parameters in the reverse process of the diffusion model. Notably, we integrate a Fast Fourier Transform (FFT) into the diffusion model to enhance the perception ability and improve the efficiency of the model. Extensive experiments demonstrate that FFT-DM outperforms current state-of-the-art inpainting approaches in terms of qualitative and quantitative analysis.
科研通智能强力驱动
Strongly Powered by AbleSci AI