八分之一(仪器)
方位角
球谐函数
算法
计算机科学
背景(考古学)
谐波
数学
模式识别(心理学)
人工智能
物理
几何学
数学分析
光学
地质学
古生物学
量子力学
电压
作者
Priyadarshini Dwivedi,Gyanajyoti Routray,Rajesh M. Hegde
标识
DOI:10.1109/tai.2024.3352530
摘要
Recent advancements in artificial intelligence (AI) have shown potential solutions to acoustic source localization in three-dimensional space. This paper proposes a new low-complex AI-based framework in the spherical harmonics (SH) domain for efficient DOA estimation. The SH coefficients are the key features for the DOA estimation and are obtained from the SH decomposition (SHD) of the spherical microphone array (SMA) recordings. Subsequently, the unified convolutional neural network (UCNN) model is trained to estimate the source azimuth and elevation from the phase and magnitude of the SH coefficient. Since the relation between the azimuth and elevation with phase and magnitude of SH coefficient is surjective. The accuracy of the training model is highly influenced by the volume of training data. In this context, the symmetric properties of the SH basis function are explored to obtain the spherical harmonics implicit symmetric coefficients (SH-ISC) that split the 3D space into octant classes. Within each octant, the phase and magnitude of the SH coefficients exhibit one-to-one correspondence with the source azimuth and elevation and execute the data redundancy. This work can be divided into two parts, a multi-class support vector machine (M-SVM) is investigated to obtain the octant classes from the SH-ISC in the first part. In the second part, the UCNN model is developed to estimate the DOA angles in each octant class. Further, the proposed technique is computationally efficient compared to the baseline learning algorithms in terms of sample and run-time complexity.
科研通智能强力驱动
Strongly Powered by AbleSci AI