重影
计算机科学
运动插值
帧(网络)
插值(计算机图形学)
过程(计算)
计算机视觉
约束(计算机辅助设计)
人工智能
运动模糊
感知
运动(物理)
图像(数学)
对象(语法)
视频跟踪
数学
块匹配算法
几何学
操作系统
神经科学
生物
电信
作者
Guangyang Wu,Xin Tao,Changlin Li,Wenyi Wang,Xiaohong Liu,Qingqing Zheng
出处
期刊:Cornell University - arXiv
日期:2024-04-09
标识
DOI:10.48550/arxiv.2404.06692
摘要
Previous methods for Video Frame Interpolation (VFI) have encountered challenges, notably the manifestation of blur and ghosting effects. These issues can be traced back to two pivotal factors: unavoidable motion errors and misalignment in supervision. In practice, motion estimates often prove to be error-prone, resulting in misaligned features. Furthermore, the reconstruction loss tends to bring blurry results, particularly in misaligned regions. To mitigate these challenges, we propose a new paradigm called PerVFI (Perception-oriented Video Frame Interpolation). Our approach incorporates an Asymmetric Synergistic Blending module (ASB) that utilizes features from both sides to synergistically blend intermediate features. One reference frame emphasizes primary content, while the other contributes complementary information. To impose a stringent constraint on the blending process, we introduce a self-learned sparse quasi-binary mask which effectively mitigates ghosting and blur artifacts in the output. Additionally, we employ a normalizing flow-based generator and utilize the negative log-likelihood loss to learn the conditional distribution of the output, which further facilitates the generation of clear and fine details. Experimental results validate the superiority of PerVFI, demonstrating significant improvements in perceptual quality compared to existing methods. Codes are available at \url{https://github.com/mulns/PerVFI}
科研通智能强力驱动
Strongly Powered by AbleSci AI