化学
代谢组学
背景(考古学)
色谱法
质谱法
校准
传感器融合
偏最小二乘回归
代谢物
人工智能
计算机科学
统计
数学
机器学习
古生物学
生物化学
生物
作者
Andrew A. Vaughan,Warwick B. Dunn,J. William Allwood,David C. Wedge,Fiona Blackhall,Anthony D. Whetton,Caroline Dive,Royston Goodacre
摘要
Metabolic profiling is routinely performed on multiple analytical platforms to increase the coverage of detected metabolites, and it is often necessary to distribute biological and clinical samples from a study between instruments of the same type to share the workload between different laboratories. The ability to combine metabolomics data arising from different sources is therefore of great interest, particularly for large-scale or long-term studies, where samples must be analyzed in separate blocks. This is not a trivial task, however, due to differing data structures, temporal variability, and instrumental drift. In this study, we employed blood serum and plasma samples collected from 29 subjects diagnosed with small cell lung cancer and analyzed each sample on two liquid chromatography–mass spectrometry (LC-MS) platforms. We describe a method for mapping retention times and matching metabolite features between platforms and approaches for fusing data acquired from both instruments. Calibration transfer models were developed and shown to be successful at mapping the response of one LC-MS instrument to another (Procrustes dissimilarity = 0.04; Mantel correlation = 0.95), allowing us to merge the data from different samples analyzed on different instruments. Data fusion was assessed in a clinical context by comparing the correlation of each metabolite with subject survival time in both the original and fused data sets: a simple autoscaling procedure (Pearson’s R = 0.99) was found to improve upon a calibration transfer method based on partial least-squares regression (R = 0.94).
科研通智能强力驱动
Strongly Powered by AbleSci AI