混淆
观察研究
遗传关联
全基因组关联研究
多效性
生物
遗传力
人口
统计
遗传学
医学
环境卫生
数学
单核苷酸多态性
基因型
基因
表型
作者
Zijie Zhao,Xiaoyu Yang,Stephen Dorn,Jiacheng Miao,Silvia Barcellos,Jason M. Fletcher,Qiongshi Lu
标识
DOI:10.1073/pnas.2408715121
摘要
Epidemiologic associations estimated from observational data are often confounded by genetics due to pervasive pleiotropy among complex traits. Many studies either neglect genetic confounding altogether or rely on adjusting for polygenic scores (PGS) in regression analysis. In this study, we unveil that the commonly employed PGS approach is inadequate for removing genetic confounding due to measurement error and model misspecification. To tackle this challenge, we introduce PENGUIN, a principled framework for polygenic genetic confounding control based on variance component estimation. In addition, we present extensions of this approach that can estimate genetically unconfounded associations using GWAS summary statistics alone as input and between multiple generations of study samples. Through simulations, we demonstrate superior statistical properties of PENGUIN compared to the existing approaches. Applying our method to multiple population cohorts, we reveal and remove substantial genetic confounding in the associations of educational attainment with various complex traits and between parental and offspring education. Our results show that PENGUIN is an effective solution for genetic confounding control in observational data analysis with broad applications in future epidemiologic association studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI