计算机科学
特征(语言学)
奇点
模式识别(心理学)
人工智能
Mel倒谱
一般化
倒谱
特征提取
小波
语音识别
数学
语言学
数学分析
哲学
作者
Kanghao Zhang,Liang Shan,Shuai Nie,Shulin He,Jiahui Pan,Xueliang Zhang,Haoxin Ma,Jiangyan Yi
标识
DOI:10.1109/icassp43922.2022.9746596
摘要
There are many methods for detecting forged audio produced by conversion and synthesis. However, as a simpler method of forgery, splicing has not attracted widespread attention. Based on the characteristic that the tampering operation will cause singularities at high-frequency components, we propose a high-frequency singularity detection feature obtained by wavelet transform. The proposed feature can explicitly show the location of the tampering operation on the waveform. Moreover, the long short-term memory (LSTM) is introduced to the CNN-architecture LCNN to ensure that the sequence information can be fully learned. The proposed feature is sent to the improved RNN-architecture LCNN together with the widely used linear frequency cepstral coefficients (LFCC) to learn forgery characteristics where the LFCC is used as a supplement. Systematic evaluation and comparison show that the proposed method has greatly improved the accuracy and generalization.
科研通智能强力驱动
Strongly Powered by AbleSci AI