化学
肽
生物信息学
胰蛋白酶
蛋白质组学
串联质谱法
质谱法
自下而上蛋白质组学
色谱法
计算生物学
生物化学
蛋白质质谱法
酶
生物
基因
作者
Jinghan Yang,Zhiyuan Cheng,Fuzhou Gong,Yan Fu
标识
DOI:10.1021/acs.analchem.2c03662
摘要
In tandem mass spectrometry-based proteomics, proteins are digested into peptides by specific protease(s), but generally only a fraction of peptides can be detected. To characterize detectable proteotypic peptides, we have developed a series of methods to predict peptide digestibility and detectability. Here, we propose a bidirectional long short-term memory (BiLSTM)-based algorithm, named DeepDetect, for the prediction of peptide detectability enhanced by peptide digestibility. Compared with existing algorithms, DeepDetect is featured by its improved prediction accuracy for a wide range of commonly used proteases, covering trypsin, ArgC, chymotrypsin, GluC, LysC, AspN, LysN, and LysargiNase. On 11 test data sets from E. coli, yeast, mouse, and human samples, DeepDetect achieved higher prediction accuracies than PepFormer, a state-of-the-art deep-learning-based peptide detectability prediction algorithm. The results further demonstrated that peptide digestibility can substantially enhance the performance of peptide detectability predictors. As an application, DeepDetect was used to reduce the in silico predicted spectral libraries in data-independent acquisition mass spectrometry data analysis. Experiments using DIA-NN software showed that DeepDetect can significantly accelerate the library search without loss of peptide and protein identification sensitivity.
科研通智能强力驱动
Strongly Powered by AbleSci AI