单倍群
基因分型
软件
生物
染色体
单核苷酸多态性
遗传学
1000基因组计划
样品(材料)
计算生物学
基因型
计算机科学
算法
单倍型
基因
色谱法
化学
程序设计语言
摘要
Abstract We have developed an algorithm to rapidly and accurately identify the Y-chromosome haplogroup of each male in a sample of one to millions. The algorithm, implemented in the yHaplo * software package (yHaplo), does not rely on any particular genotyping modality or platform. Full sequences yield the most granular haplogroup classifications, but genotyping arrays can yield reliable calls, provided a reasonable number of phylogenetically informative variants has been assayed. The algorithm is robust to missing data, genotype errors, mutation recurrence, and other complications. We have tested the software on full sequences from phase 3 of the 1000 Genomes Project and on subsets thereof constructed by downsampling to SNPs present on each of four genotyping arrays. We have also run the software on array data from more than 600,000 males.
科研通智能强力驱动
Strongly Powered by AbleSci AI