Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding

四叉树分拆（数论）计算机科学编码（社会科学）划分问题树形结构算法块（置换群论）卷积神经网络算法效率二叉树数学人工智能几何学统计组合数学

作者

Aolin Feng,Kang Liu,Dong Liu,Li Li,Feng Wu

出处

期刊：IEEE transactions on image processing [Institute of Electrical and Electronics Engineers]
日期：2023-01-01 卷期号：32: 2237-2251 被引量：35

链接

nih.govdoi.org

标识

DOI：10.1109/tip.2023.3266165

摘要

The Versatile Video Coding (VVC) standard introduces a block partitioning structure known as quadtree plus nested multi-type tree (QTMTT), which allows more flexible block partitioning compared to its predecessors, like High Efficiency Video Coding (HEVC). Meanwhile, the partition search (PS) process, which is to find out the best partitioning structure for optimizing the rate-distortion cost, becomes far more complicated for VVC than for HEVC. Also, the PS process in VVC reference software (VTM) is not friendly to hardware implementation. We propose a partition map prediction method for fast block partitioning in VVC intra-frame encoding. The proposed method may replace PS totally or be combined with PS partially, thereby achieving adjustable acceleration of the VTM intra-frame encoding. Different from the previous methods for fast block partitioning, we propose to represent a QTMTT-based block partitioning structure by a partition map, which consists of a quadtree (QT) depth map, several multi-type tree (MTT) depth maps, and several MTT direction maps. We then propose to predict the optimal partition map from the pixels through a convolutional neural network (CNN). We propose a CNN structure, known as Down-Up-CNN, for the partition map prediction, where the CNN structure emulates the recursive nature of the PS process. Moreover, we design a post-processing algorithm to adjust the network output partition map, so as to obtain a standard-compliant block partitioning structure. The post-processing algorithm may produce a partial partition tree as well; then based on the partial partition tree, the PS process is performed to obtain the full tree. Experimental results show that the proposed method achieves 1.61× to 8.64× encoding acceleration for the VTM-10.0 intra-frame encoder, with the ratio depending on how much PS is performed. Especially, when achieving 3.89× encoding acceleration, the compression efficiency loss is 2.77% in BD-rate, which is a better tradeoff than the previous methods.

求助该文献

最长约 10秒，即可获得该文献文件

Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding

今日热心研友