计算机科学
剩余框架
参考坐标系
人工智能
计算机视觉
编码(社会科学)
多视点视频编码
算法效率
像素
视差
视图合成
帧(网络)
视频处理
渲染(计算机图形)
视频跟踪
数学
电信
统计
作者
Jing Zhang,Yonghong Hou,Zhaoqing Pan,Bo Peng,Nam Ling,Jianjun Lei
标识
DOI:10.1109/tcsvt.2023.3312213
摘要
In multiview video coding, the coding performance highly depends on the quality of the reference frames. In view of this, a step-wise reference frame generation network (SWGNet) is designed to improve the quality of the reference frame for efficient multiview video coding. In particular, a frame-level to block-level learning paradigm is proposed to step-wisely generate a high-quality reference frame. In the frame-level stage, by exploiting parallax correlations between temporal and inter-view references on the basis of image alignment, a parallax-guided frame-level synthesis module is proposed to generate an elementary reference frame. Then, in the block-level stage, a transformer-based block-level aggregation module is designed to further refine the texture details of the reference frame by modeling long-range dependencies among pixels. The proposed SWGNet is integrated into 3D-HEVC, and extensive experiments demonstrate that the proposed method achieves significant bitrate saving compared with 3D-HEVC.
科研通智能强力驱动
Strongly Powered by AbleSci AI