运动插值
计算机科学
光流
Boosting(机器学习)
人工智能
计算机视觉
插值(计算机图形学)
运动估计
帧速率
帧(网络)
运动(物理)
运动补偿
块匹配算法
视频处理
视频跟踪
图像(数学)
电信
作者
Jing Xiao,Kangmin Xu,Mengshun Hu,Liang Liao,Zheng Wang,Chia‐Wen Lin,Mi Wang,Shin’ichi Satoh
标识
DOI:10.1109/tmm.2022.3233310
摘要
Video frame interpolation has made great progress in estimating advanced optical flow and synthesizing in-between frames sequentially. However, frame interpolation involving various resolutions and motions remains challenging due to limited or fixed pre-trained networks. Inspired by the success of the coarse-to-fine scheme for video frame interpolation, i.e., gradually interpolating frames of different resolutions, we propose a progressive boosting network (ProBoost-Net) based on a multi-scale framework to achieve flexible recurrent scales and then gradually optimize optical flow estimation and frame interpolation. Specifically, we designed a dense motion boosting (DMB) module to transfer features close to real motion to the decoded features from the later scales, which provides complementary information to refine the motion further. Furthermore, to ensure the accuracy of the estimated motion features at each scale, we propose a motion adaptive fusion (MAF) module that adaptively deals with motions with different receptive fields according to the motion conditions. Thanks to the framework's flexible recurrent scales, we can customize the number of scales and make trade-offs between computation and quality depending on the application scenario. Extensive experiments with various datasets demonstrated the superiority of our proposed method over state-of-the-art approaches in various scenarios.
科研通智能强力驱动
Strongly Powered by AbleSci AI