计算机科学
语音增强
噪音(视频)
语音识别
话语
人工神经网络
背景噪声
深层神经网络
语音活动检测
人工智能
模式识别(心理学)
语音处理
降噪
电信
图像(数学)
作者
Wenbo Wang,Houguang Liu,Jianhua Yang,Guohua Cao,Chunli Hua
标识
DOI:10.1142/s0217984919501884
摘要
Deep neural network (DNN) has recently been successfully adopted as a regression model in speech enhancement. Nonetheless, training machines to adapt different noise is a challenging task. Because every noise has its own characteristics which can be combined with speech utterance to give huge variation on which the model has to operate on. Thus, a joint framework combining noise classification (NC) and speech enhancement using DNN was proposed. We first determined the noise type of contaminated speech by the voice activity detection (VAD)-DNN and the NC-DNN. Then based on the noise classification results, the corresponding SE-DNN model was applied to enhance the contaminated speech. In addition, in order to make method simpler, the structure of different DNNs was similar and the features were the same. Experimental results show that the proposed method effectively improved the performance of speech enhancement in complex noise environments. Besides, the accuracy of classification had a great influence on speech enhancement.
科研通智能强力驱动
Strongly Powered by AbleSci AI