公共化学
计算机科学
代谢物
代谢组学
串联质谱法
人工智能
计算生物学
模式识别(心理学)
数据挖掘
质谱法
化学
生物
色谱法
生物化学
作者
Felicity Allen,Russell Greiner,David S. Wishart
出处
期刊:Metabolomics
[Springer Nature]
日期:2014-06-04
卷期号:11 (1): 98-110
被引量:152
标识
DOI:10.1007/s11306-014-0676-4
摘要
Electrospray tandem mass spectrometry (ESI-MS/MS) is commonly used in high throughput metabolomics. One of the key obstacles to the effective use of this technology is the difficulty in interpreting measured spectra to accurately and efficiently identify metabolites. Traditional methods for automated metabolite identification compare the target MS or MS/MS spectrum to the spectra in a reference database, ranking candidates based on the closeness of the match. However the limited coverage of available databases has led to an interest in computational methods for predicting reference MS/MS spectra from chemical structures. This work proposes a probabilistic generative model for the MS/MS fragmentation process, which we call competitive fragmentation modeling (CFM), and a machine learning approach for learning parameters for this model from MS/MS data. We show that CFM can be used in both a MS/MS spectrum prediction task (ie, predicting the mass spectrum from a chemical structure), and in a putative metabolite identification task (ranking possible structures for a target MS/MS spectrum). In the MS/MS spectrum prediction task, CFM shows significantly improved performance when compared to a full enumeration of all peaks corresponding to substructures of the molecule. In the metabolite identification task, CFM obtains substantially better rankings for the correct candidate than existing methods (MetFrag and FingerID) on tripeptide and metabolite data, when querying PubChem or KEGG for candidate structures of similar mass.
科研通智能强力驱动
Strongly Powered by AbleSci AI