计算机科学
人工智能
图像(数学)
降噪
图像去噪
噪音(视频)
模式识别(心理学)
概率逻辑
参数化复杂度
翻译(生物学)
比例(比率)
分辨率(逻辑)
任务(项目管理)
机器学习
计算机视觉
算法
工程类
物理
信使核糖核酸
基因
化学
系统工程
量子力学
生物化学
作者
Hshmat Sahak,Daniel Watson,Chitwan Saharia,David J. Fleet
出处
期刊:Cornell University - arXiv
日期:2023-02-15
标识
DOI:10.48550/arxiv.2302.07864
摘要
Diffusion models have shown promising results on single-image super-resolution and other image- to-image translation tasks. Despite this success, they have not outperformed state-of-the-art GAN models on the more challenging blind super-resolution task, where the input images are out of distribution, with unknown degradations. This paper introduces SR3+, a diffusion-based model for blind super-resolution, establishing a new state-of-the-art. To this end, we advocate self-supervised training with a combination of composite, parameterized degradations for self-supervised training, and noise-conditioing augmentation during training and testing. With these innovations, a large-scale convolutional architecture, and large-scale datasets, SR3+ greatly outperforms SR3. It outperforms Real-ESRGAN when trained on the same data, with a DRealSR FID score of 36.82 vs. 37.22, which further improves to FID of 32.37 with larger models, and further still with larger training sets.
科研通智能强力驱动
Strongly Powered by AbleSci AI