机器翻译
判决
计算机科学
翻译(生物学)
领域(数学)
人工智能
自然语言处理
神经工程
数学
化学
生物化学
基因
信使核糖核酸
纯数学
作者
B Teng,Yuan Chen,Juwei Zhang
出处
期刊:ACM Transactions on Asian and Low-Resource Language Information Processing
日期:2025-01-15
摘要
Due to the limited availability of corpora in the field of Electrical Engineering and the presence of numerous specialized terms, neural machine translation (NMT) performs poorly in translating the sentence backbone information when it is applied to corpora in the field of Electrical Engineering. In response to this issue, A method to improve NMT by using the sentence backbone information is proposed in this paper. In the proposed method, the source language sentences are used as the input of the Sentence Backbone Information Extraction Model to obtain the sentence backbone information, and then the sentence backbone information are incorporated as an auxiliary during the training process of the NMT model. Furthermore, a module called the Sentence Backbone Information Enhancement Module is introduced. It utilizes the dependency parse trees of the source language sentences to generate the sentence backbone mask matrices. These matrices are then applied to the encoder to force the NMT model to pay more attention to the backbones of sentences. On the English-Chinese parallel corpus in the field of Electrical Engineering, the proposed method in this paper outperforms the Transformer baseline translation model by 1.25 BLEU points. And it outperforms the baseline model in both METEOR and ROUGE-L evaluation metrics. It indicates that the proposed method in this paper can effectively improve translation performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI