蛋白质基因组学
人工智能
计算机科学
蛋白质组学
深度学习
卷积神经网络
机器学习
人工神经网络
碎片(计算)
鉴定(生物学)
生物信息学
工作流程
计算生物学
化学
生物
基因组学
生物化学
操作系统
基因
基因组
数据库
植物
标识
DOI:10.1038/s41587-022-01424-w
摘要
The recent development of machine learning methods to identify peptides in complex mass spectrometric data constitutes a major breakthrough in proteomics. Longstanding methods for peptide identification, such as search engines and experimental spectral libraries, are being superseded by deep learning models that allow the fragmentation spectra of peptides to be predicted from their amino acid sequence. These new approaches, including recurrent neural networks and convolutional neural networks, use predicted in silico spectral libraries rather than experimental libraries to achieve higher sensitivity and/or specificity in the analysis of proteomics data. Machine learning is galvanizing applications that involve large search spaces, such as immunopeptidomics and proteogenomics. Current challenges in the field include the prediction of spectra for peptides with post-translational modifications and for cross-linked pairs of peptides. Permeation of machine-learning-based spectral prediction into search engines and spectrum-centric data-independent acquisition workflows for diverse peptide classes and measurement conditions will continue to push sensitivity and dynamic range in proteomics applications in the coming years.
科研通智能强力驱动
Strongly Powered by AbleSci AI