隐藏字幕
计算机科学
多媒体
人工智能
图像(数学)
作者
Abhinav Kumar,Rejo Mathew
出处
期刊:Social Science Research Network
[Social Science Electronic Publishing]
日期:2020-01-01
摘要
Deep Learning (DL) has gained huge popularity due to its efficiency and its great performance on solving some specific problem regarding to Artificial Intelligence. Also substantial progress has been made in the field of Computer Vision (CV) and Natural Language Processing (NLP). The computer vision market is expected to grow from USD 10.9 billion in 2019 to USD 17.4 billion by 2024-growing at a CAGR of 7.8% during the forecast period. Because of this, the task of video captioning has emerged and it is believed that the mixture of DL, CV, NLP can provide great results. Video captioning refers to the process of automatic generation of natural language sentences that describes the contents of a given video. This paper talks about techniques like CNN, RNN, LSTM, 3D CNN, etc which are used to solve various problems.
科研通智能强力驱动
Strongly Powered by AbleSci AI