自编码
残余物
计算机科学
异常检测
人工智能
水准点(测量)
模式识别(心理学)
特征(语言学)
光流
卷积神经网络
深度学习
计算机视觉
图像(数学)
算法
语言学
哲学
大地测量学
地理
作者
Rangachary Kommanduri,Mrinmoy Ghorai
标识
DOI:10.1016/j.jvcir.2023.103860
摘要
Video anomaly detection (VAD) refers to identifying abnormal events in the surveillance video. Typically, reconstruction based video anomaly detection techniques employ convolutional autoencoders with a limited number of layers, which extracts insufficient features leading to improper network training. To address this challenge, an end-to-end unsupervised feature enhancement network, namely Bi-Residual Convolutional AutoEncoder (Bi-ResCAE) has been proposed that can learn normal events with low reconstruction error and detect anomalies with high reconstruction error. The proposed Bi-ResCAE network incorporates long–short residual connections to enhance feature reusability and training stabilization. In addition, we propose to formulate a novel VAD model that can extract appearance and motion features by fusing both the Bi-ResCAE network and optical flow network in the objective function to recognize the anomalous object in the video. Extensive experiments on three benchmark datasets validate the effectiveness of the model. The proposed model achieves an AUC (Area Under the ROC Curve) of 84.7% on Ped1, 97.7% on Ped2, and 86.71% on the Avenue dataset. The results show that the Bi-READ performs better than state-of-the-art techniques.
科研通智能强力驱动
Strongly Powered by AbleSci AI