计算机科学
双三次插值
卷积神经网络
人工智能
管道(软件)
特征(语言学)
卷积(计算机科学)
计算复杂性理论
插值(计算机图形学)
像素
计算机视觉
图像分辨率
深度学习
增采样
滤波器(信号处理)
模式识别(心理学)
人工神经网络
图像(数学)
算法
线性插值
哲学
语言学
程序设计语言
作者
Wenzhe Shi,José Caballero,Ferenc Huszár,Johannes Totz,Andrew P. Aitken,Rob Bishop,Daniel Rueckert,Zehan Wang
出处
期刊:Computer Vision and Pattern Recognition
日期:2016-06-01
卷期号:: 1874-1883
被引量:5530
标识
DOI:10.1109/cvpr.2016.207
摘要
Recently, several models based on deep neural networks have achieved great success in terms of both reconstruction accuracy and computational performance for single image super-resolution. In these methods, the low resolution (LR) input image is upscaled to the high resolution (HR) space using a single filter, commonly bicubic interpolation, before reconstruction. This means that the super-resolution (SR) operation is performed in HR space. We demonstrate that this is sub-optimal and adds computational complexity. In this paper, we present the first convolutional neural network (CNN) capable of real-time SR of 1080p videos on a single K2 GPU. To achieve this, we propose a novel CNN architecture where the feature maps are extracted in the LR space. In addition, we introduce an efficient sub-pixel convolution layer which learns an array of upscaling filters to upscale the final LR feature maps into the HR output. By doing so, we effectively replace the handcrafted bicubic filter in the SR pipeline with more complex upscaling filters specifically trained for each feature map, whilst also reducing the computational complexity of the overall SR operation. We evaluate the proposed approach using images and videos from publicly available datasets and show that it performs significantly better (+0.15dB on Images and +0.39dB on Videos) and is an order of magnitude faster than previous CNN-based methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI