连锁不平衡
修剪
生物
人口
不平衡
遗传学
主成分分析
联动装置(软件)
相关性
单倍型
等位基因
等位基因频率
进化生物学
统计
基因
数学
人口学
植物
医学
眼科
几何学
社会学
作者
Ulises Bercovich,Malthe Sebro Rasmussen,Zilong Li,Carsten Wiuf,Anders Albrechtsen
出处
期刊:Genetics
[Oxford University Press]
日期:2025-02-05
标识
DOI:10.1093/genetics/iyaf009
摘要
Abstract Standard measures of linkage disequilibrium (LD) are affected by admixture and population structure, such that loci that are not in LD within each ancestral population appear linked when considered jointly across the populations. The influence of population structure on LD can cause problems for downstream analysis methods, in particular those that rely on LD pruning or clumping. To address this issue, we propose a measure of LD that accommodates population structure using the top inferred principal components. We estimate LD from the correlation of genotype residuals and prove that this LD measure remains unaffected by population structure when analyzing multiple populations jointly, even with admixed individuals. Based on this adjusted measure of LD, we can perform LD pruning to remove the correlation between markers for downstream analysis. Traditional LD pruning is more likely to remove markers with high differences in allele frequencies between populations, which biases measures for genetic differentiation and removes markers that are not in LD in the ancestral populations. Using data from moderately differentiated human populations and highly differentiated giraffe populations we show that traditional LD pruning biases FST and principal component analysis (PCA), which can be alleviated with the adjusted LD measure. In addition, we show that the adjusted LD leads to better PCA when pruning and that LD clumping retains more sites with the retained sites having stronger associations.
科研通智能强力驱动
Strongly Powered by AbleSci AI