表达数量性状基因座
全基因组关联研究
生物
计算生物学
数量性状位点
遗传关联
遗传学
特质
虚假关系
基因
计算机科学
单核苷酸多态性
机器学习
基因型
程序设计语言
作者
Michael Wainberg,Nasa Sinnott-Armstrong,Nicholas Mancuso,Alvaro Barbeira,David A. Knowles,David E. Golan,Raili Ermel,Arno Ruusalepp,Thomas Quertermous,Ke Hao,Johan Björkegren,Hae Kyung Im,Bogdan Paşaniuc,Manuel A. Rivas,Anshul Kundaje
出处
期刊:Nature Genetics
[Springer Nature]
日期:2019-03-29
卷期号:51 (4): 592-599
被引量:750
标识
DOI:10.1038/s41588-019-0385-z
摘要
Transcriptome-wide association studies (TWAS) integrate genome-wide association studies (GWAS) and gene expression datasets to identify gene-trait associations. In this Perspective, we explore properties of TWAS as a potential approach to prioritize causal genes at GWAS loci, by using simulations and case studies of literature-curated candidate causal genes for schizophrenia, low-density-lipoprotein cholesterol and Crohn's disease. We explore risk loci where TWAS accurately prioritizes the likely causal gene as well as loci where TWAS prioritizes multiple genes, some likely to be non-causal, owing to sharing of expression quantitative trait loci (eQTL). TWAS is especially prone to spurious prioritization with expression data from non-trait-related tissues or cell types, owing to substantial cross-cell-type variation in expression levels and eQTL strengths. Nonetheless, TWAS prioritizes candidate causal genes more accurately than simple baselines. We suggest best practices for causal-gene prioritization with TWAS and discuss future opportunities for improvement. Our results showcase the strengths and limitations of using eQTL datasets to determine causal genes at GWAS loci.
科研通智能强力驱动
Strongly Powered by AbleSci AI