计算机科学
波束赋形
混响
语音增强
卡尔曼滤波器
算法
级联
语音识别
计算复杂性理论
滤波器(信号处理)
线性预测
降噪
传递函数
噪音(视频)
人工智能
电信
声学
工程类
计算机视觉
电气工程
物理
图像(数学)
化学工程
作者
Sahar Hashemgeloogerdi,Sebastian Braun
标识
DOI:10.1109/icassp40776.2020.9053785
摘要
The performance of speech processing systems degrades significantly in far-field scenarios where the distance between the user and microphones increases, leading to low signal-to-noise and signal-to-reverberation ratios. To address this challenge, combining the denoising and dereverberation techniques in both parallel and cascade configurations has been widely studied. However, a parallel or cascade combination may not be efficient while imposing a large computational complexity. We propose a constrained Kalman filter based multichannel linear prediction method to jointly perform denoising and dereverberation efficiently using an online processing algorithm. In contrast to previously proposed methods which utilize steering vectors based on the relative early transfer function, our algorithm is implemented using a direct relative transfer function based steering vector, which aims at extracting the direct sound as opposed to preserving the early reflections. We show that the proposed algorithm outperforms existing online implementations of integrated beamformer and linear prediction methods on the REVERB challenge speech enhancement task while being computationally less complex.
科研通智能强力驱动
Strongly Powered by AbleSci AI