计算机科学
规范化(社会学)
自相关
短时傅里叶变换
熵(时间箭头)
数据挖掘
语音识别
人工智能
傅里叶变换
模式识别(心理学)
傅里叶分析
数学
统计
数学分析
物理
量子力学
社会学
人类学
作者
Hossein Parineh,Majid Sarvi,Saeed Asadi Bagloee
出处
期刊:Measurement
[Elsevier]
日期:2023-12-01
卷期号:223: 113784-113784
标识
DOI:10.1016/j.measurement.2023.113784
摘要
Emergency Vehicle Detection (EVD) is a critical task in transportation management. Vehicle audio analysis provides affordable and insightful data, compared to established yet expensive vehicle detection methods. This research presents two contributions. Firstly, to decrease the processing power and memory demands, it offers a method to detect the optimal maximum sampling frequencies through an analysis of correlation between spectral entropy (SE) and dataset variance, grounded in the principles of the discrete Fourier transform (DFT) and Nyquist–Shannon theorem. Secondly, to address the inconsistency between recorded data in different datasets, this paper proposes segmenting audio into small input time windows (ITW) and applying individualized normalization. Training the model on dataset-1 achieves 98.37% accuracy on the validation set, with 86.83% accuracy on dataset-2. Comparative analysis with the baseline Long-Short-Term Memory model shows 22.79% performance improvement in favor of the proposed 1D-CNN model. Overall, this model outperforms state-of-the-art techniques, achieving accuracy rate of 98.37%.
科研通智能强力驱动
Strongly Powered by AbleSci AI