生物
推论
人口
基因组
1000基因组计划
进化生物学
计算生物学
遗传学
计算机科学
人工智能
基因
单核苷酸多态性
基因型
社会学
人口学
作者
Jonathan Terhorst,Jack Kamm,Yun S. Song
出处
期刊:Nature Genetics
[Springer Nature]
日期:2016-12-26
卷期号:49 (2): 303-309
被引量:672
摘要
Yun Song and colleagues present SMC++, a statistical method for population history inference capable of analyzing unphased whole genomes and sample sizes much larger than can be analyzed by current methods. The authors apply SMC++ to sequence data from human, Drosophila and finch populations. It has recently been demonstrated that inference methods based on genealogical processes with recombination can uncover past population history in unprecedented detail. However, these methods scale poorly with sample size, limiting resolution in the recent past, and they require phased genomes, which contain switch errors that can catastrophically distort the inferred history. Here we present SMC++, a new statistical tool capable of analyzing orders of magnitude more samples than existing methods while requiring only unphased genomes (its results are independent of phasing). SMC++ can jointly infer population size histories and split times in diverged populations, and it employs a novel spline regularization scheme that greatly reduces estimation error. We apply SMC++ to analyze sequence data from over a thousand human genomes in Africa and Eurasia, hundreds of genomes from a Drosophila melanogaster population in Africa, and tens of genomes from zebra finch and long-tailed finch populations in Australia.
科研通智能强力驱动
Strongly Powered by AbleSci AI