Statistical power of transcriptome‐wide association studies

表达数量性状基因座 单变量 全基因组关联研究 计算生物学 数量性状位点 统计能力 遗传关联 特质 生物 多元统计 遗传学 计算机科学 基因 统计 基因型 单核苷酸多态性 机器学习 数学 程序设计语言
作者
Ruoyu He,Haoran Xue,Wei Pan
出处
期刊:Genetic Epidemiology [Wiley]
卷期号:46 (8): 572-588 被引量:10
标识
DOI:10.1002/gepi.22491
摘要

Abstract Transcriptome‐Wide Association Studies (TWASs) have become increasingly popular in identifying genes (or other endophenotypes or exposures) associated with complex traits. In TWAS, one first builds a predictive model for gene expressions using an expression quantitative trait loci (eQTL) data set in stage 1, then tests the association between the predicted gene expression and a trait based on a large, independent genome‐wide association study (GWAS) data set in stage 2. However, since the sample size of the eQTL data set is usually small and the coefficient of multiple determination (i.e., ) of the model for many genes is also small, a question of interest is to what extent these factors affect the statistical power of TWAS. In addition, in contrast to a standard (univariate) TWAS (UV‐TWAS) considering only a single gene at a time, multivariate TWAS (MV‐TWAS) methods have recently emerged to account for the effects of multiple genes, or a gene's nonlinear effects, simultaneously. With the absence of the power analysis for these MV‐TWAS methods, it would be of interest to investigate whether one can gain or lose power by using the newly proposed MV‐TWAS instead of UV‐TWAS. In this paper, we first outline a general method for sample size/power calculations for two‐sample TWAS, then use real data—the Alzheimer's Disease Neuroimaging Initiative (ADNI) expression quantitative trait loci (eQTL) data and the Genotype‐Tissue Expression (GTEx) eQTL data for stage 1, the International Genomics of Alzheimer's Project Alzheimer's disease (AD) GWAS summary data and UK Biobank (UKB) individual‐level data for stage 2—to empirically address these questions. Our most important conclusions are the following. First, a sample size of a few thousands (~8000) would suffice in stage 1, where the power of TWAS would be more determined by cis ‐heritability of gene expression. Second, as in the general case of simple regression versus multiple regression, the power of MV‐TWAS may be higher or lower than that of UV‐TWAS, depending on the specific relationships among the GWAS trait and multiple genes (or linear and nonlinear terms of the same gene's expression levels), such as their correlations and effect sizes. Interestingly, several top genes with large power gains in MV‐TWAS (over that in UV‐TWAS) were known to be (and in our data more significantly) associated with AD. We also reached similar conclusions in an application to the GTEx whole blood gene expression data and UKB GWAS data of high‐density lipoprotein cholesterol. The proposed method and the conclusions are expected to be useful in planning and designing future TWAS and other related studies (e.g., Proteome‐ or Metabolome‐Wide Association Studies) when determining the sample sizes for the two stages.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
江中完成签到 ,获得积分10
1秒前
1秒前
阿玖完成签到 ,获得积分10
2秒前
jiaolulu发布了新的文献求助10
4秒前
踏雪飞鸿完成签到,获得积分10
5秒前
hannah完成签到,获得积分10
5秒前
songvv发布了新的文献求助10
6秒前
一一一应助Bin_Liu采纳,获得10
7秒前
麻果完成签到,获得积分10
9秒前
OER完成签到,获得积分10
9秒前
伦语完成签到,获得积分20
9秒前
中陆完成签到,获得积分10
10秒前
11秒前
莫西莫西完成签到,获得积分10
13秒前
15秒前
量子星尘发布了新的文献求助10
16秒前
xjh完成签到,获得积分10
16秒前
16秒前
lbnzd8g完成签到,获得积分10
18秒前
中海完成签到,获得积分10
18秒前
Ww完成签到,获得积分10
18秒前
伶俐不二完成签到,获得积分10
18秒前
XIAOJU_U完成签到 ,获得积分10
19秒前
马士全发布了新的文献求助10
20秒前
MQ完成签到,获得积分10
20秒前
单纯血茗发布了新的文献求助10
22秒前
善学以致用应助田南松采纳,获得10
22秒前
不如看海完成签到 ,获得积分10
23秒前
可靠的南露完成签到,获得积分10
24秒前
gg完成签到,获得积分10
24秒前
AU完成签到,获得积分10
26秒前
与淇完成签到,获得积分10
26秒前
开心祯祯完成签到,获得积分10
26秒前
马士全完成签到,获得积分10
27秒前
Qian完成签到 ,获得积分10
27秒前
degre完成签到 ,获得积分10
27秒前
W~舞完成签到,获得积分10
28秒前
我的文献呢应助pz采纳,获得30
29秒前
潇洒的如松完成签到,获得积分10
30秒前
逆流的鱼完成签到 ,获得积分10
31秒前
高分求助中
【提示信息,请勿应助】关于scihub 10000
Les Mantodea de Guyane: Insecta, Polyneoptera [The Mantids of French Guiana] 3000
徐淮辽南地区新元古代叠层石及生物地层 3000
The Mother of All Tableaux: Order, Equivalence, and Geometry in the Large-scale Structure of Optimality Theory 3000
Handbook of Industrial Diamonds.Vol2 1100
Global Eyelash Assessment scale (GEA) 1000
Picture Books with Same-sex Parented Families: Unintentional Censorship 550
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 4038201
求助须知:如何正确求助?哪些是违规求助? 3575940
关于积分的说明 11373987
捐赠科研通 3305747
什么是DOI,文献DOI怎么找? 1819274
邀请新用户注册赠送积分活动 892662
科研通“疑难数据库(出版商)”最低求助积分说明 815022