Audio–visual representation learning for anomaly events detection in crowds

人群 异常检测 计算机科学 代表(政治) 人工智能 视听 异常(物理) 特征学习 模式识别(心理学) 多媒体 计算机安全 物理 政治 政治学 法学 凝聚态物理
作者
Junyu Gao,Hao Yang,Maoguo Gong,Xuelong Li
出处
期刊:Neurocomputing [Elsevier BV]
卷期号:582: 127489-127489 被引量:17
标识
DOI:10.1016/j.neucom.2024.127489
摘要

In recent years, anomaly events detection in crowd scenes attracts many researchers' attentions, because of its importance to public safety. Existing methods usually exploit visual information to analyze whether any abnormal events have occurred due to only visual sensors are generally equipped in public places. However, when an abnormal event in crowds occurs, sound information may be discriminative to assist the crowd analysis system to determine whether there is an abnormality. Compared with vision information that is easily occluded, audio signals have a certain degree of penetration. Thus, this paper attempt to exploit multi-modal learning for modeling the audio and visual signals simultaneously. To be specific, we design a two-branch network to model different types of information. The first is a typical 3D CNN model to extract temporal appearance feature from video clips. The second is an audio CNN for encoding Log Mel-Spectrogram of audio signals. Finally, by fusing the above features, the more accurate prediction will be produced. We conduct the experiments on SHADE dataset, a synthetic audio–visual dataset in surveillance scenes, and find introducing audio signals effectively improves the performance of anomaly events detection and outperforms other state-of-the-art methods. Furthermore, we will release the code and the pre-trained models as soon as possible.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
1秒前
shirley应助碧蓝柠檬采纳,获得10
4秒前
爆米花应助Hongmin采纳,获得10
4秒前
5秒前
corleeang发布了新的文献求助10
5秒前
Lucas应助fang采纳,获得10
5秒前
郑启完成签到 ,获得积分10
6秒前
一口完成签到,获得积分10
8秒前
周树人发布了新的文献求助10
8秒前
188发布了新的文献求助10
9秒前
9秒前
9秒前
冯紫淇完成签到,获得积分10
10秒前
10秒前
外向白竹完成签到,获得积分10
11秒前
12秒前
Jia发布了新的文献求助10
13秒前
鲤鱼小蕾完成签到,获得积分10
14秒前
14秒前
韩豆豆发布了新的文献求助10
14秒前
小巍澜发布了新的文献求助10
15秒前
孟陬二四发布了新的文献求助100
17秒前
fang完成签到,获得积分10
19秒前
星辰大海应助Jia采纳,获得10
19秒前
czxchase发布了新的文献求助10
20秒前
zealke发布了新的文献求助10
21秒前
星辰大海应助kying采纳,获得10
24秒前
烟花应助任苏志采纳,获得10
24秒前
Terry完成签到,获得积分10
25秒前
188完成签到,获得积分10
25秒前
科研通AI2S应助缓慢如南采纳,获得10
29秒前
不浪漫罪名完成签到 ,获得积分10
31秒前
韩豆豆完成签到,获得积分10
31秒前
小贱鱼应助哈哈侠采纳,获得10
33秒前
34秒前
CodeCraft应助天空采纳,获得10
35秒前
沁虹完成签到,获得积分10
35秒前
36秒前
37秒前
高分求助中
Introduction to Helicopter and Tiltrotor Flight Simulation, Second Edition 2000
Overcoming Stigma and Bias in Obesity Management 1200
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6488544
求助须知:如何正确求助?哪些是违规求助? 8287008
关于积分的说明 17678815
捐赠科研通 5578133
什么是DOI,文献DOI怎么找? 2914079
邀请新用户注册赠送积分活动 1891141
关于科研通互助平台的介绍 1748644