计算机科学
自动汇总
人工智能
卷积神经网络
水准点(测量)
特征提取
特征(语言学)
帧(网络)
计算机视觉
视频处理
RGB颜色模型
深度学习
分割
模式识别(心理学)
弹丸
作者
Xuming Feng,Yaping Zhu,Cheng Yang
标识
DOI:10.1109/ic-nidc54101.2021.9660579
摘要
Video summarization is a technique that creates short summaries from original videos while retaining the main representative information. Traditional video summarization models based on deep learning mostly use frames as the basic processing unit, which cannot handle long videos due to hardware limitations. In this paper, we compress the frame-level features into shot-level features using a feature extractor based on Convolutional Neural Network (CNN), which can improve the training accuracy and reduce computation. At the same time, we propose a feature fusion algorithm based on the capsule network, which combines the RGB features and Light Flow features of the video into the deep features with adaptive weights to enhance the original video features. Experiment results on two benchmark datasets (TVsum and SumMe) validate the effectiveness of our method.
科研通智能强力驱动
Strongly Powered by AbleSci AI