听觉场景分析
立体声录音
噪音(视频)
分离(统计)
作者
Yuzhou Liu,Masood Delfarah,DeLiang Wang
出处
期刊:International Conference on Acoustics, Speech, and Signal Processing
日期:2020-05-01
被引量:11
标识
DOI:10.1109/icassp40776.2020.9054572
摘要
Monaural speech separation is the task of separating target speech from interference in single-channel recordings. Although substantial progress has been made recently in deep learning based speech separation, previous studies usually focus on a single type of interference, either background noise or competing speakers. In this study, we address both speech and nonspeech interference, i.e., monaural speaker separation in noise, in a talker-independent fashion. We extend a recently proposed deep CASA system to deal with noisy speaker mixtures. To facilitate speech enhancement, a denoising module is added to deep CASA as a front-end processor. The proposed systems achieve state-of-the-art results on a benchmark noisy two-speaker separation dataset. The denoising module leads to substantial performance gain across various noise types, and even better generalization in noise-free conditions.
科研通智能强力驱动
Strongly Powered by AbleSci AI