计算机科学
概率逻辑
平滑的
图像(数学)
插值(计算机图形学)
人工智能
残余物
高斯分布
噪音(视频)
混合模型
模式识别(心理学)
算法
图像分辨率
计算机视觉
量子力学
物理
作者
Haoying Li,Yifan Yang,Meng Chang,Shiqi Chen,Huajun Feng,Zhihai Xu,Qi Li,Yueting Chen
标识
DOI:10.1016/j.neucom.2022.01.029
摘要
Single image super-resolution (SISR) aims to reconstruct high-resolution (HR) images from given low-resolution (LR) images. It is an ill-posed problem because one LR image corresponds to multiple HR images. Recently, learning-based SISR methods have greatly outperformed traditional methods. However, PSNR-oriented, GAN-driven and flow-based methods suffer from over-smoothing, mode collapse and large model footprint issues, respectively. To solve these problems, we propose a novel SISR diffusion probabilistic model (SRDiff), which is the first diffusion-based model for SISR. SRDiff is optimized with a variant of the variational bound on the data likelihood. Through a Markov chain, it can provide diverse and realistic super-resolution (SR) predictions by gradually transforming Gaussian noise into a super-resolution image conditioned on an LR input. In addition, we introduce residual prediction to the whole framework to speed up model convergence. Our extensive experiments on facial and general benchmarks (CelebA and DIV2K datasets) show that (1) SRDiff can generate diverse SR results with rich details and achieve competitive performance against other state-of-the-art methods, when given only one LR input; (2) SRDiff is easy to train with a small footprint(The word “footprint” in this paper represents “model size” (number of model parameters).); (3) SRDiff can perform flexible image manipulation operations, including latent space interpolation and content fusion.
科研通智能强力驱动
Strongly Powered by AbleSci AI