发布文献求助

Accounting for Digestion Enzyme Bias in Casanovo

胰蛋白酶酶计算生物学元数据消化（炼金术）质谱法产量（工程）计算机科学化学色谱法生物化学数据挖掘生物材料科学冶金操作系统

作者

Carlo Melendez,Justin Sanders,Melih Yilmaz,Wout Bittremieux,William E. Fondrie,Sewoong Oh,William Stafford Noble

出处

期刊：Journal of Proteome Research [American Chemical Society]
日期：2024-08-30

链接

标识

DOI：10.1021/acs.jproteome.4c00422

摘要

A key parameter of any bottom-up proteomics mass spectrometry experiment is the identity of the enzyme that is used to digest proteins in the sample into peptides. The Casanovo de novo sequencing model was trained using data that was generated with trypsin digestion; consequently, the model prefers to predict peptides that end with the amino acids "K" or "R". This bias is desirable when Casanovo is used to analyze data that was also generated using trypsin but can be problematic if the data was generated using some other digestion enzyme. In this work, we modify Casanovo to take as input the identity of the digestion enzyme alongside each observed spectrum. We then train Casanovo with data generated by using several different enzymes, and we demonstrate that the resulting model successfully learns to capture enzyme-specific behavior. However, we find, surprisingly, that this new model does not yield a significant improvement in sequencing accuracy relative to a model trained without enzyme information but using the same training set. This observation may have important implications for future attempts to make use of experimental metadata in de novo sequencing models.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 汉堡包上传了应助文件

刚刚; kk完成签到，获得积分10

1秒前; 东东q东东完成签到，获得积分10

1秒前; 田様的应助被彭彭采纳，获得10

2秒前; tanghong完成签到，获得积分10

2秒前; 阿玖完成签到，获得积分10

2秒前; xia发布了新的文献求助10

3秒前; 舒适逊完成签到，获得积分20

4秒前; xh-notes发布了新的文献求助20

4秒前; ww发布了新的文献求助10

4秒前; coco完成签到，获得积分10

5秒前; 神勇映安发布了新的文献求助30

5秒前; SciGPT的应助被阳光下的味道采纳，获得10

5秒前; stszd完成签到，获得积分10

5秒前; echo完成签到，获得积分10

6秒前; 3123939715完成签到，获得积分10

8秒前; yan123完成签到，获得积分10

8秒前; KAER完成签到，获得积分20

8秒前; 火星上的藏鸟完成签到，获得积分10

9秒前; 左丘不评完成签到，获得积分0

9秒前; 彭彭完成签到，获得积分20

10秒前; KeYang完成签到，获得积分10

10秒前; 田様的应助被KAER采纳，获得10

12秒前; 晓晨完成签到，获得积分10

12秒前; zhuzhu完成签到，获得积分10

12秒前; 优雅小橘子完成签到，获得积分10

12秒前; 小灰灰完成签到，获得积分10

12秒前; fhp完成签到，获得积分10

12秒前; 金花猪饲养员完成签到，获得积分10

15秒前; xh-notes完成签到，获得积分10

15秒前; 星辰大海上传了应助文件

15秒前; Yziii的应助被hongw_liu采纳，获得10

15秒前; 匆匆赶路人完成签到，获得积分10

16秒前; 科研通AI2S上传了应助文件

17秒前; wawaaaah完成签到，获得积分10

18秒前; 晴天完成签到，获得积分10

18秒前; 天真依玉完成签到，获得积分10

19秒前; diki完成签到，获得积分10

19秒前; siyuyu完成签到，获得积分10

19秒前; BlingBling完成签到，获得积分10

19秒前

高分求助中: Sustainability in Tides Chemistry 2000; Microlepidoptera Palaearctica, Volumes 1 and 3 - 13 (12-Volume Set) [German] 1122; Дружба 友好报 (1957-1958) 1000; The Data Economy: Tools and Applications 1000; Mantiden - Faszinierende Lauerjäger – Buch gebraucht kaufen 700; PraxisRatgeber Mantiden., faszinierende Lauerjäger. – Buch gebraucht kaufe 700; A Dissection Guide & Atlas to the Rabbit 600

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3099877; 求助须知：如何正确求助？哪些是违规求助？ 2751333; 关于积分的说明 7612942; 捐赠科研通 2403282; 什么是DOI，文献DOI怎么找？ 1275217; 科研通“疑难数据库（出版商）”最低求助积分说明 616310; 版权声明 599053

今日热心研友

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2024 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：826996720【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通