单声道
小波
计算机科学
小波变换
傅里叶变换
估计
人工智能
语音识别
电子工程
工程类
计算机视觉
数学
系统工程
数学分析
作者
Hossein Parineh,Majid Sarvi,Saeed Asadi Bagloee
出处
期刊:IEEE Sensors Journal
[Institute of Electrical and Electronics Engineers]
日期:2024-03-29
卷期号:24 (9): 15337-15346
被引量:1
标识
DOI:10.1109/jsen.2024.3381276
摘要
Vehicle speed estimation using acoustic data offers a cost-effective and non-intrusive approach to enhance traffic safety and efficiency. Previous studies are limited by controlled settings, constraints on vehicle types, and reliance on manual tuning. This paper proposes an end-to-end framework for monaural settings in a multi-lane roadway, subject to real-life ambient noise without restriction on vehicle types. First, we develop a multimodal feature vector by processing raw audio data using a hybrid Fourier-Wavelet method. Second, a careful examination of the ambient noise and vehicular audio guarantees the validity of the suggested feature vector. Third, we design two deep neural networks to handle both regression and classification tasks. Our method is evaluated against state-of-the-art on a benchmark dataset comprising 304 samples. The results demonstrate a substantial improvement in accuracy, increasing by 29.4% (achieving 83.26% accuracy for the target class), and enhancement in the root mean square error (RMSE) for the regression task by 5.6%. In addition, we provide a proprietary dataset collected and curated as part of this study in four urban locations in Melbourne, Australia. This dataset represents the first real-world compilation of complex acoustic speed data, comprising 364 samples. Subsequent tests yield a classification accuracy of 84.02%, as well as Mean Average Error (MAE) and Root Mean Square Error (RMSE) values of 6.38 and 8.09 for regression, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI