隐藏字幕
计算机科学
变压器
人工智能
生成语法
机器学习
建筑
任务(项目管理)
数据科学
图像(数学)
工程类
艺术
视觉艺术
系统工程
电压
电气工程
作者
Uddagiri Sirisha,B. Sai Chandana
标识
DOI:10.1109/icssit55814.2023.10061116
摘要
Data and computer science advancements in recent years have greatly benefited people’s day-to-day lives. On the other hand, criminals are embracing new technologies to bolster and broaden their operations. There is a great deal of promise in applying the Deep Learning (DL) paradigm to the analysis of highly structured data. However, the availability of public datasets in the crime detection area is low and task-specific, making it difficult to study and develop DL-assisted solutions that are both accurate and robust. This study aims to adapt the popular UCF-crime dataset for use with video subtitling and propose a hybrid model GITAAR (Generative Image-totext Transformer for abnormal activity recognition), a new transformer-architecture for video-caption-generation. In this paper, UCF crime dataset that compares a recently suggested video captioning system against a large number of state-of-theart methods, describing both the qualitative and quantitative aspects.
科研通智能强力驱动
Strongly Powered by AbleSci AI