声源定位
互相关
混响
相关性
计算机科学
欧几里德距离
分类器(UML)
麦克风阵列
话筒
朴素贝叶斯分类器
语音识别
测距
人工智能
模式识别(心理学)
支持向量机
算法
声学
数学
声音(地理)
物理
电信
统计
声压
几何学
作者
Xinwang Wan,Zhenyang Wu
标识
DOI:10.1016/j.apacoust.2012.06.006
摘要
Sound source localization plays a crucial role in many microphone arrays application, ranging from speech enhancement to human–computer interface in a reverberant noisy environment. The steered response power (SRP) using the phase transform (SRP-PHAT) method is one of the most popular modern localization algorithms. The SRP-based source localizers have been proved robust, however, the methods may fail to locate the sound source in adverse noise and reverberation conditions, especially when the direct paths to the microphones are unavailable. This paper proposes a localization algorithm based on discrimination of cross-correlation functions. The cross-correlation functions are calculated by the generalized cross-correlation phase transform (GCC-PHAT) method. Using cross-correlation functions, sound source location is estimated by one of the two classifiers: Naive-Bayes classifier and Euclidean distance classifier. Simulation results have demonstrated that the proposed algorithms provide higher localization accuracy than the SRP-PHAT algorithm in reverberant noisy environment.
科研通智能强力驱动
Strongly Powered by AbleSci AI