缺少数据
孟德尔随机化
插补(统计学)
单核苷酸多态性
统计
SNP公司
计算机科学
样本量测定
数据挖掘
数学
生物
遗传学
遗传变异
基因
基因型
作者
Stephen Burgess,Shaun R. Seaman,Debbie A. Lawlor,J. P. Casas,Simon G. Thompson
摘要
Mendelian randomization studies typically have low power. Where there are several valid candidate genetic instruments, precision can be gained by using all the instruments available. However, sporadically missing genetic data can offset this gain. The authors describe 4 Bayesian methods for imputing the missing data based on a missing-at-random assumption: multiple imputations, single nucleotide polymorphism (SNP) imputation, latent variables, and haplotype imputation. These methods are demonstrated in a simulation study and then applied to estimate the causal relation between C-reactive protein and each of fibrinogen and coronary heart disease, based on 3 SNPs in British Women's Heart and Health Study participants assessed at baseline between May 1999 and June 2000. A complete-case analysis based on all 3 SNPs was found to be more precise than analyses using any 1 SNP alone. Precision is further improved by using any of the 4 proposed missing data methods; the improvement is equivalent to about a 25% increase in sample size. All methods gave similar results, which were apparently not overly sensitive to violation of the missing-at-random assumption. Programming code for the analyses presented is available online.
科研通智能强力驱动
Strongly Powered by AbleSci AI