光谱图
计算机科学
人工智能
特征(语言学)
语音识别
计算机视觉
频道(广播)
噪音(视频)
音频信号处理
模式识别(心理学)
音频信号
语音编码
图像(数学)
计算机网络
哲学
语言学
作者
Güzin Ulutaş,Gül Tahaoğlu,Beste Üstübioğlu
标识
DOI:10.1109/tsp55681.2022.9851327
摘要
Audio copy-move-forgery audio is one of the most popular methods in the field of audio forensic. This type of forgery is created by copying one or more audio segments and pasting it in another position within the same audio. In this study, for detection of the audio copy-move forgery, a new method using a keypoint-based scheme on the Mel spectrogram model of audio is presented. Firstly, Mel spectrogram image is generated from the suspicious audio. Then, SURF keypoints are obtained from each RBG color channel of Mel spectrogram image. The obtained keypoints from each channel are matched via feature vectors to reveal whether the audio file is forged or original. Finally, the proposed post-processing step is applied to eliminate possible false matches. In the method, providing sufficient final matched keypoints according to the threshold value of the number of matches which is determined by experimental studies reveals that the audio file is forged. Experimental studies are carried out on publicly available the pitch-based dataset. The performance results prove that the proposed method is more robust against even under post-processing operations like noise addition, filtering operation, and compression operation.
科研通智能强力驱动
Strongly Powered by AbleSci AI