代谢组学
计算生物学
转化(遗传学)
数量性状位点
回归
排列(音乐)
计算机科学
全基因组关联研究
特质
人口
多重比较问题
生物
数据挖掘
统计
遗传学
数学
生物信息学
单核苷酸多态性
医学
基因型
基因
程序设计语言
物理
环境卫生
声学
作者
Sanghun Lee,Rachel S. Kelly,Kevin Mendez,Dmitry Prokopenko,Georg Hahn,Sharon M. Lutz,Juan C. Celedón,Clary B. Clish,Scott T. Weiss,Christoph Lange,Jessica Lasky‐Su,Julian Hecker
出处
期刊:Science Advances
[American Association for the Advancement of Science]
日期:2025-04-11
卷期号:11 (15)
标识
DOI:10.1126/sciadv.adp4532
摘要
Metabolomic genome-wide association studies (mGWASs), or metabolomic quantitative trait locus (metQTL) analyses, are gaining growing attention. However, robust methods and analysis guidelines, vital to address the complexity of metabolomic data, remain to be established. Here, we use whole-genome sequencing and metabolomic data from two independent studies to compare different approaches. We adopted three popular data transformation methods for metabolite levels—(i) log 10 transformation, (ii) rank inverse normal transformation, and (iii) a fully adjusted two-step procedure—and compared population-based versus family-based analysis approaches. For validation, we performed permutation-based testing, Huber regression, and independent replication analysis. Simulation studies were used to illustrate the observed differences between data transformations. We demonstrate the advantages and limitations of popular analytic strategies used in mGWASs where especially low-frequency variants in combination with a skewed metabolite measurement distribution can lead to potentially false-positive metQTL findings. We recommend the rank inverse normal transformation or robust test statistics such as in family-based association tests as reliable approaches for mGWASs.
科研通智能强力驱动
Strongly Powered by AbleSci AI