质谱
嵌入
谱线
聚类分析
计算机科学
光谱聚类
利用
接头(建筑物)
人工神经网络
蛋白质组
比例(比率)
质谱法
人工智能
数据挖掘
化学
模式识别(心理学)
物理
生物信息学
生物
色谱法
量子力学
工程类
计算机安全
建筑工程
天文
作者
Wout Bittremieux,Damon May,Jeff Bilmes,William Stafford Noble
摘要
Abstract Computational methods that aim to exploit publicly available mass spectrometry repositories primarily rely on unsupervised clustering of spectra. Here, we propose to train a deep neural network in a supervised fashion based on previous assignments of peptides to spectra. The network, called “GLEAMS,” learns to embed spectra into a low-dimensional space in which spectra generated by the same peptide are close to one another. We use GLEAMS as the basis for a large-scale spectrum clustering, detecting groups of unidentified, proximal spectra representing the same peptide, and we show how to use these clusters to explore the dark proteome of repeatedly observed yet consistently unidentified mass spectra. We provide a software implementation of our approach, along with a tool to quickly embed additional spectra using a pre-trained model, to facilitate large-scale analyses.
科研通智能强力驱动
Strongly Powered by AbleSci AI