计算机科学
编解码器
增采样
人工智能
数据压缩
采样(信号处理)
计算机视觉
视频压缩图片类型
多视点视频编码
抽取
卷积神经网络
视频处理
视频跟踪
电信
滤波器(信号处理)
图像(数学)
作者
Yuzhuo Wei,Li Chen,Li Song
标识
DOI:10.1109/vcip53242.2021.9675356
摘要
With the blooming of deep learning technology in computer vision, the integration of deep learning and the traditional video coding has made significant improvements, especially applying the super-resolution neural network as the post-processing module in the down-sampling-based video compression framework. However, the pre-processing module lacks back-propagated gradients for jointly considering down-sampling and up-sampling due to the non-differentiability of the traditional video codec. In this paper, we propose an end- to-end down-sampling-based video compression framework applying convolutional neural networks both as down-sampling and upsampling. We use a virtual codec neural network to approximate the actual video codec so that the gradient can be effectively back-propagated for joint training. Experimental results show the superiority of our proposed framework compared with the predefined down-sampling-based video compression and various methods of joint training.
科研通智能强力驱动
Strongly Powered by AbleSci AI