Inference of Population Structure using Dense Haplotype Data

生物 连锁不平衡 推论 人口 聚类分析 可解释性 主成分分析 单倍型 联动装置(软件) 遗传学 进化生物学 计算生物学 数据挖掘 人工智能 计算机科学 基因 基因型 社会学 人口学
作者
Daniel J. Lawson,Garrett Hellenthal,Simon Myers,Daniel Falush
出处
期刊:PLOS Genetics [Public Library of Science]
卷期号:8 (1): e1002453-e1002453 被引量:1137
标识
DOI:10.1371/journal.pgen.1002453
摘要

The advent of genome-wide dense variation data provides an opportunity to investigate ancestry in unprecedented detail, but presents new statistical challenges. We propose a novel inference framework that aims to efficiently capture information on population structure provided by patterns of haplotype similarity. Each individual in a sample is considered in turn as a recipient, whose chromosomes are reconstructed using chunks of DNA donated by the other individuals. Results of this "chromosome painting" can be summarized as a "coancestry matrix," which directly reveals key information about ancestral relationships among individuals. If markers are viewed as independent, we show that this matrix almost completely captures the information used by both standard Principal Components Analysis (PCA) and model-based approaches such as STRUCTURE in a unified manner. Furthermore, when markers are in linkage disequilibrium, the matrix combines information across successive markers to increase the ability to discern fine-scale population structure using PCA. In parallel, we have developed an efficient model-based approach to identify discrete populations using this matrix, which offers advantages over PCA in terms of interpretability and over existing clustering algorithms in terms of speed, number of separable populations, and sensitivity to subtle population structure. We analyse Human Genome Diversity Panel data for 938 individuals and 641,000 markers, and we identify 226 populations reflecting differences on continental, regional, local, and family scales. We present multiple lines of evidence that, while many methods capture similar information among strongly differentiated groups, more subtle population structure in human populations is consistently present at a much finer level than currently available geographic labels and is only captured by the haplotype-based approach. The software used for this article, ChromoPainter and fineSTRUCTURE, is available from http://www.paintmychromosomes.com/.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
希望天下0贩的0应助秋秋采纳,获得50
刚刚
废废废发布了新的文献求助10
刚刚
酒石酸完成签到,获得积分10
1秒前
肖淑美完成签到 ,获得积分10
2秒前
宁静完成签到,获得积分10
3秒前
li应助许琳琳采纳,获得10
3秒前
3秒前
3秒前
小星星完成签到 ,获得积分10
4秒前
科研通AI5应助Bonnienuit采纳,获得10
5秒前
学好英语完成签到,获得积分10
5秒前
爱雨霁完成签到,获得积分10
5秒前
5秒前
玩命的寄翠完成签到 ,获得积分10
5秒前
科研通AI5应助南敏株采纳,获得10
6秒前
田様应助空空采纳,获得10
6秒前
6秒前
tg2024发布了新的文献求助10
6秒前
ruoyu111发布了新的文献求助20
6秒前
睚眦倒影完成签到,获得积分10
7秒前
科研通AI5应助linzy采纳,获得30
7秒前
开水发布了新的文献求助10
8秒前
8秒前
田所浩二完成签到 ,获得积分10
9秒前
9秒前
Ava应助睚眦倒影采纳,获得10
10秒前
12秒前
宇宙凛发布了新的文献求助10
12秒前
12秒前
啦啦啦啦呼完成签到,获得积分10
12秒前
脑洞疼应助奥利奥采纳,获得10
12秒前
happy发布了新的文献求助10
12秒前
13秒前
13秒前
yyyfff应助幸幸采纳,获得10
13秒前
小二郎应助幸幸采纳,获得10
13秒前
zhenya发布了新的文献求助10
14秒前
15秒前
15秒前
科研通AI5应助奉天BB机采纳,获得10
15秒前
高分求助中
【此为提示信息,请勿应助】请按要求发布求助,避免被关 20000
Production Logging: Theoretical and Interpretive Elements 3000
CRC Handbook of Chemistry and Physics 104th edition 1000
Gay and Lesbian Asia 1000
Density Functional Theory: A Practical Introduction, 2nd Edition 840
J'AI COMBATTU POUR MAO // ANNA WANG 660
Izeltabart tapatansine - AdisInsight 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3758862
求助须知:如何正确求助?哪些是违规求助? 3301919
关于积分的说明 10120099
捐赠科研通 3016294
什么是DOI,文献DOI怎么找? 1656447
邀请新用户注册赠送积分活动 790425
科研通“疑难数据库(出版商)”最低求助积分说明 753871