计算机科学
去相关
话筒
扬声器
Echo(通信协议)
语音识别
麦克风阵列
一般化
频道(广播)
多输入多输出
声学
算法
电信
数学
计算机网络
数学分析
物理
作者
Chenggang Zhang,Jinjiang Liu,Hao Li,Xueliang Zhang
标识
DOI:10.1109/taslp.2023.3282103
摘要
Deep learning is introduced in multi-channel (MC) and multi-microphone (MM) acoustic echo cancellation (AEC) without decorrelation to the loudspeaker signals and achieves remarkable performance. In this paper, we propose a complex spectral mapping framework with inplace convolution and frequency-wise temporal modeling for MCAEC problem, which efficiently models the echo paths and spatial information. The proposed method is a multi-input and multi-output (MIMO) scheme, which filters out echoes from all microphone signals simultaneously, so the computational cost is greatly reduced. In addition, a cross-domain loss function with a multi-task learning strategy is designed for better generalization capability. Experiments are conducted on various unmatched scenarios and results show that the proposed method significantly outperforms previous methods. Moreover, a lightweight version of the proposed model with 0.29 million trainable parameters also shows good performance, which is essential for resource-limited and real-time applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI