计算机科学
任务(项目管理)
适应(眼睛)
语音学
语音识别
自然语言处理
人工智能
语言学
心理学
哲学
管理
神经科学
经济
作者
Y. Liu,Yue Zhao,Xiaona Xu,Liang Xu,Xubei Zhang,Qiang Ji
标识
DOI:10.1186/s13636-024-00361-7
摘要
Abstract The disparities in phonetics and corpuses across the three major dialects of Tibetan exacerbate the difficulty of a single task model for one dialect to accommodate other different dialects. To address this issue, this paper proposes task-diverse meta-learning. Our model can acquire more comprehensive and robust features, facilitating its adaptation to the variations among different dialects. This study uses Tibetan dialect ID recognition and Tibetan speaker recognition as the source tasks for meta-learning, which aims to augment the ability of the model to discriminate variations and differences among different dialects. Consequently, the model’s performance in Tibetan multi-dialect speech recognition tasks is enhanced. The experimental results show that task-diverse meta-learning leads to improved performance in Tibetan multi-dialect speech recognition. This demonstrates the effectiveness and applicability of task-diverse meta-learning, thereby contributing to the advancement of speech recognition techniques in multi-dialect environments.
科研通智能强力驱动
Strongly Powered by AbleSci AI