单倍型
可扩展性
生物
源代码
软件
集合(抽象数据类型)
样品(材料)
模式(计算机接口)
编码(集合论)
人口
计算机科学
计算生物学
遗传学
理论计算机科学
基因
操作系统
基因型
物理
程序设计语言
热力学
人口学
社会学
作者
Lianjiang Chi,Xiaolong Zhang,Yongbiao Xue,Hua Chen
标识
DOI:10.1111/1755-0998.13829
摘要
Haplotype networks can be used to demonstrate the genealogical relationships of DNA sequences within species, and thus are widely applied in population genetics, molecular ecology, epidemiology and evolutionary studies. However, existing programs become computationally infeasible as the sample size increases. Here, we present fastHaN, an efficient and scalable program suitable for constructing haplotype networks for large samples. On a data set with the haplotype length of 30 kb, the Median Joining Network (MJN) algorithm implemented by fastHaN is 3000 times faster than PopART and 70 times faster than NETWORK in single-threaded mode. The implementation of the Templeton-Crandall-Sing (TCS) algorithm is 100 times faster than PopART and 5800 times faster than the TCS software. Moreover, fastHaN also enables multi-threaded mode with scalability. The source code is freely available on https://github.com/ChenHuaLab/fastHaN/. A web-based version is also available on https://ngdc.cncb.ac.cn/haplotype/.
科研通智能强力驱动
Strongly Powered by AbleSci AI