计算机科学
光谱图
稳健性(进化)
互操作性
生物识别
卷积神经网络
说话人识别
电话
语音识别
人工智能
生物化学
化学
基因
操作系统
语言学
哲学
作者
Lázaro J. González‐Soler,Marta Gomez‐Barrero,Madhu R. Kamble,Massimiliano Todisco,Christoph Busch
标识
DOI:10.1109/iwbf55382.2022.9794518
摘要
Improving the robustness of biometric systems to external attacks is of the utmost importance for the research community. In particular, Automatic Speaker Verification (ASV) can be easily bypassed by launching either attack presentations (i.e., physical access attacks) over the capture devices (i.e., micro-phone) or exchanging the input sample in the channel between the capture device and the signal processor (i.e., logical access attacks). In order to address these security threats, ASVspoof challenges have evaluated the generalisation ability of several Presentation Attack Detection (PAD) approaches in the last decade. Those algorithms have reported a remarkable detection performance to detect physical and logical access attacks when they are combined with the decision provided by the ASV systems. They fundamentally depend upon the complementary information of ASV systems for a reliable detection performance. Therefore, they are not interoperable across different systems. In this work, we propose an interoperable dual-stream PAD method which leverages temporal information from image-based voice spectrograms to enhance generalisation on PAD. The experimental results conducted over the publicly available ASVspoof 2019 and 2021 databases show the feasibility of our approach to detect both physical and logical access attacks unknown in training.
科研通智能强力驱动
Strongly Powered by AbleSci AI