发布文献求助

Document-Level Machine Translation with Large Language Models

计算机科学背景（考古学）自然语言处理机器翻译翻译（生物学）人工智能地理生物化学化学考古信使核糖核酸基因

作者

Longyue Wang,Chenyang Lyu,Tianbo Ji,Zhirui Zhang,Dian Yu,Shuming Shi,Zhaopeng Tu

出处

期刊：Cornell University - arXiv 日期：2023-01-01 被引量：7

链接

arxiv.org datacite.orgdoi.org

标识

DOI：10.48550/arxiv.2304.02210

摘要

Large language models (LLMs) such as ChatGPT can produce coherent, cohesive, relevant, and fluent answers for various natural language processing (NLP) tasks. Taking document-level machine translation (MT) as a testbed, this paper provides an in-depth evaluation of LLMs' ability on discourse modeling. The study focuses on three aspects: 1) Effects of Context-Aware Prompts, where we investigate the impact of different prompts on document-level translation quality and discourse phenomena; 2) Comparison of Translation Models, where we compare the translation performance of ChatGPT with commercial MT systems and advanced document-level MT methods; 3) Analysis of Discourse Modelling Abilities, where we further probe discourse knowledge encoded in LLMs and shed light on impacts of training techniques on discourse modeling. By evaluating on a number of benchmarks, we surprisingly find that LLMs have demonstrated superior performance and show potential to become a new paradigm for document-level translation: 1) leveraging their powerful long-text modeling capabilities, GPT-3.5 and GPT-4 outperform commercial MT systems in terms of human evaluation; 2) GPT-4 demonstrates a stronger ability for probing linguistic knowledge than GPT-3.5. This work highlights the challenges and opportunities of LLMs for MT, which we hope can inspire the future design and evaluation of LLMs.We release our data and annotations at https://github.com/longyuewangdcu/Document-MT-LLM.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 花花完成签到，获得积分10

1秒前; loong完成签到，获得积分20

1秒前; 十三发布了新的文献求助10

2秒前; 大模型上传了应助文件

2秒前; Lee完成签到，获得积分10

2秒前; 薰硝壤的应助被毛毛高采纳，获得10

2秒前; 魔幻的从梦上传了应助文件

2秒前; bkagyin上传了应助文件

3秒前; 科研通AI2S的应助被volition采纳，获得10

4秒前; 科研通AI2S上传了应助文件

4秒前; 桐桐的应助被wangyu采纳，获得10

4秒前; 无花果的应助被π.采纳，获得10

5秒前; 草莓派完成签到，获得积分10

6秒前; 机智的傲柏完成签到，获得积分10

7秒前; 斯文败类的应助被诸葛朝雪采纳，获得10

7秒前; 坂井泉水发布了新的文献求助10

8秒前; Z_Z完成签到，获得积分10

8秒前; 小橘完成签到，获得积分20

8秒前; 好（づ ωど）发布了新的文献求助10

8秒前; 今后上传了应助文件

9秒前; 科研通AI2.0上传了应助文件

10秒前; Jasper的应助被YA采纳，获得10

10秒前; 夏来上传了应助文件

11秒前; W敏驳回了星辰大海的应助

12秒前; 阻塞阀发布了新的文献求助10

12秒前; 俊鱼完成签到，获得积分10

12秒前; CodeCraft上传了应助文件

13秒前; 充电宝的应助被马敬丽采纳，获得10

14秒前; kp完成签到，获得积分10

14秒前; Akim的应助被坂井泉水采纳，获得10

15秒前; SciGPT的应助被Dearjw1655采纳，获得20

15秒前; 隐形的巴豆完成签到，获得积分10

15秒前; AAAAA发布了新的文献求助10

15秒前; 研友_VZG7GZ的应助被月兮2013采纳，获得10

16秒前; 大个的应助被lsl采纳，获得10

16秒前; Sumor发布了新的文献求助10

17秒前; 树上熊完成签到，获得积分10

17秒前; 在水一方上传了应助文件

17秒前; 有重名的上传了应助文件

17秒前; FashionBoy上传了应助文件

18秒前

高分求助中: rhetoric, logic and argumentation: a guide to student writers 1000; QMS18Ed2 | process management. 2nd ed 1000; One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000; A Chronicle of Small Beer: The Memoirs of Nan Green 1000; From Rural China to the Ivy League: Reminiscences of Transformations in Modern Chinese History 900; Eric Dunning and the Sociology of Sport 850; The Cambridge Introduction to Intercultural Communication 700

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 2916547; 求助须知：如何正确求助？哪些是违规求助？ 2557126; 关于积分的说明 6916523; 捐赠科研通 2217141; 什么是DOI，文献DOI怎么找？ 1178458; 版权声明 588403; 科研通“疑难数据库（出版商）”最低求助积分说明 576742

今日热心研友

第六秒的鱼

热心市民小红花

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2024 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：826996720【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通