Imputation accuracy across global human populations

插补(统计学) 1000基因组计划 全基因组关联研究 次等位基因频率 生物 统计 生命银行 等位基因频率 遗传学 单核苷酸多态性 基因型 人口学 缺少数据 数学 基因 社会学
作者
Jordan L. Cahoon,Xinyue Rui,E. Tang,Christopher M. Simons,Jalen Langie,Minhui Chen,Ying‐Chu Lo,Charleston W. K. Chiang
出处
期刊:American Journal of Human Genetics [Elsevier BV]
被引量:3
标识
DOI:10.1016/j.ajhg.2024.03.011
摘要

Genotype imputation is now fundamental for genome-wide association studies but lacks fairness due to the underrepresentation of references from non-European ancestries. The state-of-the-art imputation reference panel released by the Trans-Omics for Precision Medicine (TOPMed) initiative improved the imputation of admixed African-ancestry and Hispanic/Latino samples, but imputation for populations primarily residing outside of North America may still fall short in performance due to persisting underrepresentation. To illustrate this point, we imputed the genotypes of over 43,000 individuals across 123 populations around the world and identified numerous populations where imputation accuracy paled in comparison to that of European-ancestry populations. For instance, the mean imputation r-squared (Rsq) for variants with minor allele frequencies between 1% and 5% in Saudi Arabians (n = 1,061), Vietnamese (n = 1,264), Thai (n = 2,435), and Papua New Guineans (n = 776) were 0.79, 0.78, 0.76, and 0.62, respectively, compared to 0.90-0.93 for comparable European populations matched in sample size and SNP array content. Outside of Africa and Latin America, Rsq appeared to decrease as genetic distances to European-ancestry reference increased, as predicted. Using sequencing data as ground truth, we also showed that Rsq may over-estimate imputation accuracy for non-European populations more than European populations, suggesting further disparity in accuracy between populations. Using 1,496 sequenced individuals from Taiwan Biobank as a second reference panel to TOPMed, we also assessed a strategy to improve imputation for non-European populations with meta-imputation, but this design did not improve accuracy across frequency spectra. Taken together, our analyses suggest that we must ultimately strive to increase diversity and size to promote equity within genetics research.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
小杜完成签到,获得积分10
刚刚
1111完成签到,获得积分20
1秒前
笨笨的蓝天完成签到,获得积分10
1秒前
挖掘机应助spinor采纳,获得200
1秒前
1秒前
1秒前
赘婿应助好好好采纳,获得10
1秒前
搞怪的翠萱完成签到,获得积分10
2秒前
李健应助科研顺利1采纳,获得10
2秒前
3秒前
3秒前
在水一方应助迷人问兰采纳,获得20
4秒前
大个应助加美希尔采纳,获得10
4秒前
小太阳烤焦了完成签到,获得积分10
4秒前
6秒前
7秒前
装饭的桶完成签到,获得积分10
7秒前
Moon发布了新的文献求助10
7秒前
曾志强完成签到,获得积分10
7秒前
uouuo完成签到 ,获得积分10
8秒前
記yian发布了新的文献求助10
8秒前
黑炭球完成签到,获得积分10
8秒前
niko发布了新的文献求助10
8秒前
kaillera完成签到,获得积分10
8秒前
lee发布了新的文献求助10
8秒前
冰柠檬完成签到,获得积分20
9秒前
10秒前
张皓123完成签到,获得积分10
10秒前
善学以致用应助yangL采纳,获得10
10秒前
星辰大海应助judy采纳,获得10
10秒前
Jojo发布了新的文献求助10
10秒前
11秒前
11秒前
今后应助飞0802采纳,获得10
12秒前
玲珑骰子安红豆完成签到,获得积分10
12秒前
火花发布了新的文献求助10
12秒前
stop here发布了新的文献求助10
12秒前
jjn完成签到,获得积分10
12秒前
12秒前
12秒前
高分求助中
The Mother of All Tableaux Order, Equivalence, and Geometry in the Large-scale Structure of Optimality Theory 2400
Ophthalmic Equipment Market by Devices(surgical: vitreorentinal,IOLs,OVDs,contact lens,RGP lens,backflush,diagnostic&monitoring:OCT,actorefractor,keratometer,tonometer,ophthalmoscpe,OVD), End User,Buying Criteria-Global Forecast to2029 2000
Optimal Transport: A Comprehensive Introduction to Modeling, Analysis, Simulation, Applications 800
Official Methods of Analysis of AOAC INTERNATIONAL 600
ACSM’s Guidelines for Exercise Testing and Prescription, 12th edition 588
T/CIET 1202-2025 可吸收再生氧化纤维素止血材料 500
Interpretation of Mass Spectra, Fourth Edition 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 3951344
求助须知:如何正确求助?哪些是违规求助? 3496706
关于积分的说明 11083953
捐赠科研通 3227150
什么是DOI,文献DOI怎么找? 1784304
邀请新用户注册赠送积分活动 868345
科研通“疑难数据库(出版商)”最低求助积分说明 801102