卷积神经网络
机制(生物学)
主管(地质)
计算机科学
声音(地理)
人工神经网络
人工智能
声学
地质学
物理
量子力学
地貌学
作者
Liang Cong,Qian Chen,Qiran Li,Qingnan Wang,Kang Zhao,Jihui Tu,Ammar Jafaripournimchahi
出处
期刊:Electronics
[MDPI AG]
日期:2024-10-28
卷期号:13 (21): 4229-4229
标识
DOI:10.3390/electronics13214229
摘要
Video surveillance is an effective tool for traffic management and safety, but it may face challenges in extreme weather, low visibility, areas outside the monitoring field of view, or during nighttime conditions. Therefore, abnormal sound detection is used in traffic management and safety as an auxiliary tool to complement video surveillance. In this paper, a novel lightweight method for abnormal sound detection based on 1D CNN and Multi-Head Self-Attention Mechanism on the embedded system is proposed, which is named HADNet. First, 1D CNN is employed for local feature extraction, which minimizes information loss from the audio signal during time-frequency conversion and reduces computational complexity. Second, the proposed block based on Multi-Head Self-Attention Mechanism not only effectively mitigates the issue of disappearing gradients, but also enhances detection accuracy. Finally, the joint loss function is employed to detect abnormal audio. This choice helps address issues related to unbalanced training data and class overlap, thereby improving model performance on imbalanced datasets. The proposed HADNet method was evaluated on the MIVIA Road Events and UrbanSound8K datasets. The results demonstrate that the proposed method for abnormal audio detection on embedded systems achieves high accuracy of 99.6% and an efficient detection time of 0.06 s. This approach proves to be robust and suitable for practical applications in traffic management and safety. By addressing the challenges posed by traditional video surveillance methods, HADNet offers a valuable and complementary solution for enhancing safety measures in diverse traffic conditions.
科研通智能强力驱动
Strongly Powered by AbleSci AI