发布文献求助

CommitBART: A Large Pre-trained Model for GitHub Commits

提交计算机科学水准点（测量）任务（项目管理）软件源代码人工智能机器学习软件工程软件开发编码器程序设计语言数据库操作系统大地测量学地理管理经济

作者

Shangqing Liu,Yanzhou Li,Yang Liu

出处

期刊：Cornell University - arXiv 日期：2022-01-01 被引量：1

链接

arxiv.org datacite.orgdoi.org

标识

DOI：10.48550/arxiv.2208.08100

摘要

GitHub commits, which record the code changes with natural language messages for description, play a critical role for software developers to comprehend the software evolution. To promote the development of the open-source software community, we collect a commit benchmark including over 7.99 million commits across 7 programming languages. Based on this benchmark, we present CommitBART, a large pre-trained encoder-decoder Transformer model for GitHub commits. The model is pre-trained by three categories (i.e., denoising objectives, cross-modal generation and contrastive learning) for six pre-training tasks to learn commit fragment representations. Furthermore, we unify a ``commit intelligence'' framework with one understanding task and three generation tasks for commits. The comprehensive experiments on these tasks demonstrate that CommitBARTsignificantly outperforms previous pre-trained works for code. Further analysis also reveals each pre-training task enhances the model performance.

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 星辰大海的应助被ee采纳，获得10

1秒前; 外星汽水关闭了外星汽水的文献求助

1秒前; Lenacici发布了新的文献求助10

1秒前; q792309106发布了新的文献求助10

1秒前; Viv完成签到，获得积分10

1秒前; samuel发布了新的文献求助10

1秒前; Liufgui上传了应助文件

2秒前; 朴实的哈密瓜数据线发布了新的文献求助10

3秒前; Wei关闭了Wei的文献求助

3秒前; 李健的小迷弟上传了应助文件

3秒前; psc完成签到，获得积分10

4秒前; 善学以致用上传了应助文件

4秒前; 在水一方上传了应助文件

5秒前; 共享精神的应助被zyy采纳，获得10

5秒前; czh上传了应助文件

6秒前; 樱丸小桃子完成签到，获得积分10

6秒前; 千千完成签到，获得积分10

7秒前; 充电宝上传了应助文件

7秒前; 在水一方上传了应助文件

7秒前; 烟花的应助被冰菱采纳，获得10

7秒前; 落寞振家完成签到，获得积分20

8秒前; xinying发布了新的文献求助10

9秒前; 华仔上传了应助文件

9秒前; 西北发布了新的文献求助10

9秒前; 奋斗蜗牛发布了新的文献求助10

10秒前; ztt发布了新的文献求助10

10秒前; znnnnnnnnnn完成签到，获得积分10

11秒前; 风清扬的应助被鱼儿采纳，获得10

13秒前; 小牛马发布了新的文献求助10

13秒前; znnnnnnnnnn发布了新的文献求助10

13秒前; 研友_ngKyqn发布了新的文献求助10

14秒前; xiaohu6311完成签到，获得积分10

14秒前; Hello的应助被汤飞柏采纳，获得10

14秒前; uy完成签到，获得积分10

15秒前; 毕业比耶发布了新的文献求助20

15秒前; CC完成签到，获得积分10

17秒前; 万能图书馆上传了应助文件

17秒前; 852的应助被多吃肉采纳，获得10

18秒前; 量子星尘发布了新的文献求助10

18秒前; 小蘑菇上传了应助文件

18秒前

高分求助中: Picture Books with Same-sex Parented Families: Unintentional Censorship 1000; A new approach to the extrapolation of accelerated life test data 1000; ACSM’s Guidelines for Exercise Testing and Prescription, 12th edition 500; Nucleophilic substitution in azasydnone-modified dinitroanisoles 500; Indomethacinのヒトにおける経皮吸収 400; Phylogenetic study of the order Polydesmida (Myriapoda: Diplopoda) 370; 基于可调谐半导体激光吸收光谱技术泄漏气体检测系统的研究 310

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3979479; 求助须知：如何正确求助？哪些是违规求助？ 3523421; 关于积分的说明 11217607; 捐赠科研通 3260944; 什么是DOI，文献DOI怎么找？ 1800264; 邀请新用户注册赠送积分活动 879017; 科研通“疑难数据库（出版商）”最低求助积分说明 807126

今日热心研友

昏睡的蟠桃

热心市民小红花

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通