发布文献求助

CACRN-Net: A 3D log Mel spectrogram based channel attention convolutional recurrent neural network for few-shot speaker identification

过度拟合光谱图计算机科学卷积神经网络语音识别鉴定（生物学）频道（广播）人工智能任务（项目管理）说话人识别说话人日记深度学习说话人识别模式识别（心理学）人工神经网络工程类植物生物计算机网络系统工程

作者

Banala Saritha,Mohammad Azharuddin Laskar,Anish Monsley Kirupakaran,Rabul Hussain Laskar,Madhuchhanda Choudhury

出处

期刊：Computers & Electrical Engineering [Elsevier]
日期：2024-01-31 卷期号：115: 109100-109100 被引量：1

标识

DOI：10.1016/j.compeleceng.2024.109100

摘要

Advancements in deep learning for speaker identification are constrained by the limited availability of data, especially in law enforcement applications. This has led to the emergence of few-shot speaker identification, a technique that classifies unseen test samples with the help of a few support samples. Despite several attempts to advance few-shot speaker identification, significant challenges persist, including the extraction of robust speaker embeddings, the problem of overfitting, and the issue of prototype shift error. This paper proposes a few-shot speaker identification system employing a novel architecture called the Channel Attention-based Convolutional Recurrent Neural Network (CACRN-Net) with three-dimensional (3D) log Mel spectrogram inputs to mitigate overfitting and enhance the accuracy of speaker embeddings. Furthermore, a self-attention mechanism alleviates prototype shift errors caused by noisy data. The proposed framework is compared to existing methods using VCTK and Voxceleb1 speech corpora through 5-way, 5-shot learning experiments. To assess the performance of the framework in speech variability conditions, we utilized the IIT Guwahati (IITG) multi-variability (MV) speech database. The proposed approach outperforms state-of-the-art techniques, achieving a substantial enhancement in speaker identification with a 2.73 % accuracy improvement on the VCTK database and a 2.3 % improvement on Voxceleb1.

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: ljh发布了新的文献求助10

1秒前; 从容芮上传了应助文件

1秒前; Alan完成签到，获得积分10

1秒前; 小屋给小屋的求助进行了留言

2秒前; 赘婿的应助被laura采纳，获得10

3秒前; 打打的应助被02Zhu采纳，获得10

4秒前; xwc完成签到，获得积分10

6秒前; 斯文败类的应助被青年才俊采纳，获得10

6秒前; 子车茗的应助被青年才俊采纳，获得10

6秒前; 赘婿的应助被青年才俊采纳，获得10

6秒前; 薰硝壤的应助被青年才俊采纳，获得10

6秒前; 薰硝壤的应助被青年才俊采纳，获得10

6秒前; 子车茗的应助被青年才俊采纳，获得10

7秒前; 在水一方的应助被青年才俊采纳，获得10

7秒前; ming的应助被青年才俊采纳，获得10

7秒前; 叶子发布了新的文献求助10

7秒前; Jack_Xue发布了新的文献求助30

7秒前; 赘婿上传了应助文件

10秒前; 从容芮上传了应助文件

11秒前; 科研通AI2.0上传了应助文件

12秒前; 乐乐的应助被ljh采纳，获得10

12秒前; 科研通AI2.0上传了应助文件

13秒前; 布鲁盖完成签到，获得积分10

13秒前; MeSs完成签到，获得积分10

14秒前; 大阳阳发布了新的文献求助10

14秒前; CipherSage上传了应助文件

15秒前; lyric上传了应助文件

15秒前; BEST完成签到，获得积分10

15秒前; Owen上传了应助文件

15秒前; jy完成签到，获得积分10

16秒前; 科研通AI2.0上传了应助文件

16秒前; 布鲁盖发布了新的文献求助10

16秒前; Pessimist完成签到，获得积分0

17秒前; 今后上传了应助文件

18秒前; 从容芮上传了应助文件

18秒前; 科研通AI2.0上传了应助文件

19秒前; Leon发布了新的文献求助10

19秒前; SciGPT的应助被噗噗采纳，获得10

19秒前; feifeifei发布了新的文献求助10

20秒前; 桐桐上传了应助文件

20秒前

高分求助中: Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 2000; Die Elektra-Partitur von Richard Strauss : ein Lehrbuch für die Technik der dramatischen Komposition 1000; How to Create Beauty: De Lairesse on the Theory and Practice of Making Art 1000; Gerard de Lairesse : an artist between stage and studio 670; 大平正芳: 「戦後保守」とは何か 550; LNG地下タンク躯体の構造性能照査指針 500; Cathodoluminescence and its Application to Geoscience 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3000699; 求助须知：如何正确求助？哪些是违规求助？ 2660589; 关于积分的说明 7205732; 捐赠科研通 2296440; 什么是DOI，文献DOI怎么找？ 1217683; 科研通“疑难数据库（出版商）”最低求助积分说明 593864; 版权声明 592943

今日热心研友

科研文献搬运工

坚强的广山

热心市民小红花

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2024 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：826996720【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通