发布文献求助

ProphetNet: Predicting Future N-gram for Sequence-to-SequencePre-training

计算机科学过度拟合自动汇总序列（生物学）人工智能纳克背景（考古学）机器学习克比例（比率）语言模型人工神经网络物理古生物学生物量子力学细菌遗传学

作者

Weizhen Qi,Yongtao Yu,Yeyun Gong,Dayiheng Liu,Nan Duan,Jiusheng Chen,Ruofei Zhang,Ming Zhou

链接

arxiv.org arxiv.orgdoi.org

标识

DOI：10.18653/v1/2020.findings-emnlp.217

摘要

This paper presents a new sequence-to-sequence pre-training model called ProphetNet, which introduces a novel self-supervised objective named future n-gram prediction and the proposed n-stream self-attention mechanism. Instead of optimizing one-step-ahead prediction in the traditional sequence-to-sequence model, the ProphetNet is optimized by n-step ahead prediction that predicts the next n tokens simultaneously based on previous context tokens at each time step. The future n-gram prediction explicitly encourages the model to plan for the future tokens and prevent overfitting on strong local correlations. We pre-train ProphetNet using a base scale dataset (16GB) and a large-scale dataset (160GB), respectively. Then we conduct experiments on CNN/DailyMail, Gigaword, and SQuAD 1.1 benchmarks for abstractive summarization and question generation tasks. Experimental results show that ProphetNet achieves new state-of-the-art results on all these datasets compared to the models using the same scale pre-training corpus.

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 张鱼小丸子发布了新的文献求助20

刚刚; 酷波er上传了应助文件

1秒前; Rondab上传了应助文件

1秒前; 活力雨灵完成签到，获得积分10

1秒前; wanci上传了应助文件

3秒前; lilongcheng发布了新的文献求助10

3秒前; 失眠的访彤发布了新的文献求助10

3秒前; CipherSage上传了应助文件

3秒前; 科研通AI2S上传了应助文件

4秒前; 雪雪儿发布了新的文献求助10

4秒前; 夏果完成签到，获得积分20

4秒前; Yang发布了新的文献求助10

4秒前; 陈冲完成签到，获得积分10

4秒前; pluto上传了应助文件

4秒前; 宫冷雁发布了新的文献求助10

5秒前; lsn完成签到，获得积分10

6秒前; 卡卡西的应助被科研通管家采纳，获得10

6秒前; Ava的应助被科研通管家采纳，获得10

6秒前; 充电宝的应助被科研通管家采纳，获得10

6秒前; 穆仰的应助被科研通管家采纳，获得10

7秒前; 李健的应助被科研通管家采纳，获得10

7秒前; 卡卡西的应助被科研通管家采纳，获得20

7秒前; 善学以致用的应助被科研通管家采纳，获得10

7秒前; FashionBoy的应助被科研通管家采纳，获得10

7秒前; 深情安青的应助被科研通管家采纳，获得10

7秒前; 人生如梦的应助被科研通管家采纳，获得10

7秒前; 兴奋的定帮的应助被科研通管家采纳，获得10

7秒前; 王子安的应助被科研通管家采纳，获得10

7秒前; 核桃的应助被龙辉采纳，获得10

7秒前; 汉堡包的应助被科研通管家采纳，获得10

7秒前; 英俊的铭的应助被科研通管家采纳，获得10

8秒前; 打打的应助被科研通管家采纳，获得10

8秒前; 李健的小迷弟的应助被科研通管家采纳，获得10

8秒前; 传奇3的应助被科研通管家采纳，获得10

8秒前; 李爱国的应助被科研通管家采纳，获得10

8秒前; 卡卡西的应助被科研通管家采纳，获得10

8秒前; May的应助被科研通管家采纳，获得20

8秒前; 英姑的应助被科研通管家采纳，获得10

8秒前; ding上传了应助文件

8秒前; 烟花的应助被科研通管家采纳，获得10

8秒前

高分求助中: The Mother of All Tableaux Order, Equivalence, and Geometry in the Large-scale Structure of Optimality Theory 2400; Ophthalmic Equipment Market by Devices(surgical: vitreorentinal,IOLs,OVDs,contact lens,RGP lens,backflush,diagnostic&monitoring:OCT,actorefractor,keratometer,tonometer,ophthalmoscpe,OVD), End User,Buying Criteria-Global Forecast to2029 2000; A new approach to the extrapolation of accelerated life test data 1000; Cognitive Neuroscience: The Biology of the Mind 1000; Cognitive Neuroscience: The Biology of the Mind (Sixth Edition) 1000; Optimal Transport: A Comprehensive Introduction to Modeling, Analysis, Simulation, Applications 800; Official Methods of Analysis of AOAC INTERNATIONAL 600

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3958850; 求助须知：如何正确求助？哪些是违规求助？ 3505102; 关于积分的说明 11122496; 捐赠科研通 3236558; 什么是DOI，文献DOI怎么找？ 1788899; 邀请新用户注册赠送积分活动 871424; 科研通“疑难数据库（出版商）”最低求助积分说明 802794

今日热心研友

热心市民小红花

眼睛大雨筠

昏睡的蟠桃

现代的访曼

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通