计算机科学
语音识别
资源(消歧)
人工智能
自然语言处理
计算机网络
作者
Dhanva Eledath,Arun Baby,Shatrughan Singh
标识
DOI:10.1109/ncc60321.2024.10485786
摘要
Robust accented speech recognition is a challenging task in the field of automatic speech recognition (ASR). Accurate recognition of low-resource accents can significantly improve the performance of speech-based systems in various applications such as virtual assistants, communication devices, and language learning tools. However, ASR models often struggle to accurately recognize these accents due to their variability in pronunciation and language use. The state-of-the-art conformer transducer model for ASR is trained with the help of model-agnostic meta-learning to improve the performance of the system across different accents of English in this work. An improvement of about 12 % relative word error rate is achieved using a publicly available dataset for most of the low-resource accents.
科研通智能强力驱动
Strongly Powered by AbleSci AI