变压器
计算机科学
语音识别
任务(项目管理)
自然语言处理
工程类
电气工程
电压
系统工程
作者
Zhengjia Dan,Yue Zhao,Xiaojun Bi,Licheng Wu,Qiang Ji
标识
DOI:10.1007/978-3-031-17120-8_9
摘要
In this paper, we apply multi-task learning to perform low-resource multi-dialect speech recognition, and propose a method combining Transformer and soft parameter sharing multi-task learning. Our model has two task streams: the primary task stream that recognizes speech and the auxiliary task stream identifies the dialect. The auxiliary task stream provides the dialect identification information to the auxiliary cross-attention of the primary task stream, so that the primary task stream has dialect discrimination. Experimental results on the task of Tibetan multi-dialect speech recognition show that our model outperforms the single-dialect model and hard parameter sharing based multi-dialect model, by reducing the average syllable error rate (ASER) by 30.22% and 3.89%, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI