粒度
计算机科学
机器翻译
资源(消歧)
翻译(生物学)
人工智能
知识共享
知识管理
化学
计算机网络
程序设计语言
生物化学
信使核糖核酸
基因
作者
Chenggang Mi,Shaoliang Xie,Yi Fan
出处
期刊:ACM Transactions on Asian and Low-Resource Language Information Processing
日期:2024-01-09
卷期号:23 (2): 1-19
摘要
As the rapid development of deep learning methods, neural machine translation (NMT) has attracted more and more attention in recent years. However, lack of bilingual resources decreases the performance of the low-resource NMT model seriously. To overcome this problem, several studies put their efforts on knowledge transfer from high-resource language pairs to low-resource language pairs. However, these methods usually focus on one single granularity of language and the parameter sharing among different granularities in NMT is not well studied. In this article, we propose to improve the parameter sharing in low-resource NMT by introducing multi-granularity knowledge such as word, phrase and sentence. This knowledge can be monolingual and bilingual. We build the knowledge sharing model for low-resource NMT based on a multi-task learning framework, three auxiliary tasks such as syntax parsing, cross-lingual named entity recognition, and natural language generation are selected for the low-resource NMT. Experimental results show that the proposed method consistently outperforms six strong baseline systems on several low-resource language pairs.
科研通智能强力驱动
Strongly Powered by AbleSci AI