计算机科学
波束赋形
人工智能
自适应波束形成器
模式识别(心理学)
稳健性(进化)
算法
噪音(视频)
人工神经网络
估计员
计算机视觉
作者
Chengyun Deng,Hui Song,Yi Zhang,Sha Yongtao,Xiangang Li
出处
期刊:International Conference on Acoustics, Speech, and Signal Processing
日期:2020-05-04
卷期号:: 4647-4651
被引量:1
标识
DOI:10.1109/icassp40776.2020.9054239
摘要
Spectral mask based beamforming has showed competitive performance on multi-channel speech enhancement in recent years. However, such methods apply mask estimation on each channel and ensemble the masks from multiple channels into one for speech and noise covariance estimation. Spectral-spatial mask estimation has not been well extended yet. In this paper, we propose a novel spectral-spatial mask based beamforming method for two-channel noisy signals, where spectral amplitude and cross-channel spatial features are integrated to improve mask estimation. Multi-channel masks are not merged in order to preserve channel characteristics for robust beamforming. Furthermore, this two-channel method is extended to six-channel scenario. Experiments on CHiME3 evaluation confirm the superior performance of the proposed method over two spectral mask estimation approaches in terms of word error rates (WER) improvement.
科研通智能强力驱动
Strongly Powered by AbleSci AI