计算机科学
人工智能
计算机视觉
运动(物理)
噪音(视频)
帧(网络)
卷积神经网络
振铃人工制品
响铃
放大倍数
滤波器(信号处理)
图像(数学)
电信
作者
Tae-Hyun Oh,Ronnachai Jaroensri,Chang-Il Kim,Mohamed Elgharib,Frédo Durand,William T. Freeman,Wojciech Matusik
标识
DOI:10.1007/978-3-030-01225-0_39
摘要
Video motion magnification techniques allow us to see small motions previously invisible to the naked eyes, such as those of vibrating airplane wings, or swaying buildings under the influence of the wind. Because the motion is small, the magnification results are prone to noise or excessive blurring. The state of the art relies on hand-designed filters to extract representations that may not be optimal. In this paper, we seek to learn the filters directly from examples using deep convolutional neural networks. To make training tractable, we carefully design a synthetic dataset that captures small motion well, and use two-frame input for training. We show that the learned filters achieve high-quality results on real videos, with less ringing artifacts and better noise characteristics than previous methods. While our model is not trained with temporal filters, we found that the temporal filters can be used with our extracted representations up to a moderate magnification, enabling a frequency-based motion selection. Finally, we analyze the learned filters and show that they behave similarly to the derivative filters used in previous works. Our code, trained model, and datasets will be available online.
科研通智能强力驱动
Strongly Powered by AbleSci AI