计算机科学
混响
阶段(地层学)
语音识别
接头(建筑物)
时域
人工神经网络
频域
可理解性(哲学)
噪音(视频)
网络体系结构
领域(数学分析)
干扰(通信)
人工智能
电信
计算机网络
声学
工程类
频道(广播)
数学
物理
图像(数学)
数学分析
哲学
认识论
古生物学
计算机视觉
建筑工程
生物
作者
Junjie Xia,Hongqing Liu,Yi Zhou,Zhen Luo
标识
DOI:10.1109/iccc54389.2021.9674288
摘要
Our auditory experience is constantly disturbed by background noise and indoor reverberation in the actual speech environment, seriously damaging speech intelligibility and quality. In the past studies, people have proposed a two-stage deep neural network based on frequency domain to eliminate the above interference, and they suffer from some limitations, resulting in the upper limit of its performance. This paper proposes an end-to-end two-stage deep neural network in the time domain, eliminating noise in the first stage and reverberation in the second stage. First of all, we train the two-stage network separately and separate training parameters as the initial values for the two-stage network joint training. Compared with single-stage network and two-stage frequency domain network, the proposed two-stage time domain network presents better performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI