光谱图
计算机科学
噪音(视频)
降噪
语音识别
语音增强
人工神经网络
过程(计算)
深度学习
音频信号
噪声测量
音质
音频信号处理
人工智能
语音编码
图像(数学)
操作系统
作者
S. Jassem Mohammed,N. Radhika
出处
期刊:Lecture notes on data engineering and communications technologies
日期:2022-01-01
卷期号:: 33-47
被引量:1
标识
DOI:10.1007/978-981-16-7610-9_3
摘要
Improving speech quality is becoming a basic requirement with increasing interest in speech processing applications. A lot of speech enhancement techniques are developed to reduce or completely remove listeners fatigue from various devices like smartphones and also from online communication applications. Background noise often interrupts communication, and this was solved using a hardware physical device that normally emits a negative frequency of the incoming audio noise signal to cancel out the noise. Deep learning has recently made a break-through in the speech enhancement process. This paper proposes an audio denoising model which is built on a deep neural network architecture based on spectrograms (which is a hybrid between frequency domain and time domain). The proposed deep neural network model effectively predicts the negative noise frequency for given input incoming audio file with noise. After prediction, the predicted values are then removed from the original noise audio file to create the denoised audio output.
科研通智能强力驱动
Strongly Powered by AbleSci AI