GIPMA: Global Intensity-Guided Peak Matching and Alignment for 2D 1H–13C HSQC-Based Metabolomics

异核单量子相干光谱 代谢组学 化学 匹配(统计) 化学位移 二维核磁共振波谱 生物系统 星团(航天器) 分析化学(期刊) 核磁共振 统计 计算机科学 物理 数学 色谱法 立体化学 物理化学 生物 程序设计语言
作者
Huan Du,Xiu Gu,Jialuo Chen,Caihong Bai,Xiaohui Duan,Kaifeng Hu
出处
期刊:Analytical Chemistry [American Chemical Society]
卷期号:95 (6): 3195-3203 被引量:4
标识
DOI:10.1021/acs.analchem.2c03323
摘要

Two-dimensional (2D) 1H–13C heteronuclear single quantum coherence (HSQC) has been increasingly applied to metabolomics studies because it can greatly improve the resolving capability compared with one-dimensional (1D) 1H NMR. However, preprocessing methods such as peak matching and alignment tools for 2D NMR-based metabolomics have lagged behind similar methods for 1D 1H NMR-based metabolomics. Correct matching and alignment of 2D NMR spectral features across multiple samples are particularly important for subsequent multivariate data analysis. Considering different intensity dynamic ranges of a variety of metabolites and the chemical shift variation across the spectra of multiple samples, here, we developed an efficient peak matching and alignment algorithm for 2D 1H–13C HSQC-based metabolomics, called global intensity-guided peak matching and alignment (GIPMA). In GIPMA, peaks identified in all spectra are pooled together and sorted by intensity. Chemical shift of a stronger peak is regarded to be more accurate and reliable than that of a weaker peak. The strongest undesignated peak is chosen as the reference of a new cluster if it is not located within the chemical shift tolerance of any existing peak cluster (PC), or otherwise it is matched to an existing PC and the aligned chemical shift of the PC is updated as the intensity-weighted average of the chemical shifts of all peaks in the cluster. Setting an optimum chemical shift tolerance (Δδo) is critical for the peak matching and alignment across multiple samples. GIPMA dynamically searches for and intelligently selects the Δδo for peak matching to maximize the number of valid peak clusters (vPC), that is, spectral features, among multiple samples. By GIPMA, fully automatic peakwise matching and alignment do not require any spectrum as initial reference, while the chemical shift of each PC is updated as the intensity-weighted average of the chemical shifts of all peaks in the same PC, which is warranted to be statistically more accurate. Accurate chemical shifts for each representative spectral feature will facilitate subsequent peak assignment and are essential for correct metabolite identification and result interpretation. The proposed method was demonstrated successfully on the spectra of six model mixtures consisting of seven typical metabolites, yielding correct matching of all known spectral features. The performance of GIPMA was also demonstrated on 2D 1H–13C HSQC spectra of 87 real extracts of 29 samples of five Dendrobium species. Hierarchical cluster analysis (HCA) and principal component analysis (PCA) of the 87 matched and aligned spectra by GIPMA generates correct classification of the 29 samples into five groups. In summary, the proposed algorithm of GIPMA provided a practical peak matching and alignment method to facilitate 2D NMR-based metabolomics studies.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
YIQISUDA应助炙热的蘑菇采纳,获得10
刚刚
研友_ndv5j8完成签到,获得积分10
刚刚
xxPcy完成签到,获得积分10
1秒前
1秒前
2秒前
今后应助辛勤大米采纳,获得10
4秒前
5秒前
weibinhu完成签到,获得积分10
6秒前
浅呀呀呀发布了新的文献求助10
6秒前
丘比特应助Mirabel采纳,获得10
6秒前
doppelganger发布了新的文献求助10
6秒前
6秒前
Orange应助独特的莫言采纳,获得50
7秒前
7秒前
9秒前
10秒前
11秒前
lyk发布了新的文献求助10
11秒前
辛勤大米完成签到,获得积分10
12秒前
大知闲闲发布了新的文献求助10
12秒前
恭喜发布了新的文献求助10
14秒前
挡住所有坏运气888完成签到,获得积分10
14秒前
Li发布了新的文献求助10
15秒前
ding应助神勇的煎蛋采纳,获得10
16秒前
辛勤大米发布了新的文献求助10
16秒前
JamesPei应助从容的白容采纳,获得10
16秒前
Penzias发布了新的文献求助10
19秒前
ZSS完成签到,获得积分10
20秒前
斯文败类应助嘟嘟图图采纳,获得10
20秒前
顾矜应助zzzzw采纳,获得10
21秒前
Hedya完成签到,获得积分10
22秒前
23秒前
27秒前
28秒前
蜘蛛侠发布了新的文献求助150
29秒前
kavins凯旋完成签到,获得积分10
30秒前
30秒前
这届视网膜好带不完成签到,获得积分10
30秒前
花佩剑完成签到,获得积分10
31秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Modern Epidemiology, Fourth Edition 5000
Digital Twins of Advanced Materials Processing 2000
Weaponeering, Fourth Edition – Two Volume SET 2000
Polymorphism and polytypism in crystals 1000
Signals, Systems, and Signal Processing 610
Discrete-Time Signals and Systems 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6025037
求助须知:如何正确求助?哪些是违规求助? 7659561
关于积分的说明 16178111
捐赠科研通 5173271
什么是DOI,文献DOI怎么找? 2768125
邀请新用户注册赠送积分活动 1751495
关于科研通互助平台的介绍 1637631