欺骗攻击
计算机科学
语音识别
探测器
语音活动检测
分类器(UML)
说话人验证
人工智能
分拆(数论)
模式识别(心理学)
语音处理
说话人识别
数学
计算机安全
电信
组合数学
作者
Chengwei Wei,Runqi Pang,C.‐C. Jay Kuo
标识
DOI:10.1109/icassp48485.2024.10448336
摘要
A green-learning-based spoofed speech detector that decides whether an input speech sample is bona fide (genuine) or spoofed in an automatic speaker verification (ASV) system, is proposed in this work. The proposed solution, called the green ASVspoof detector (GAD), adopts Wav2vec (version 2.0) speech representations as its front-end model. We partition an input speech sample into temporal segments and adopt the Wav2vec representation for each segment. Then, GAD is a 3-stage decision process comprising one XGBoost classifier in each stage. It offers an interpretable design. It is shown by experimental results that GAD achieves competitive performance in ASVspoof detection. At the same time, it has a smaller model size and significantly lower computational complexity, thus positioning it as an effective and green solution.
科研通智能强力驱动
Strongly Powered by AbleSci AI