自举(财务)
复制
置信区间
计算机科学
序列(生物学)
系统发育树
分数(化学)
统计
生物
计量经济学
数学
遗传学
基因
化学
有机化学
作者
Sudip Sharma,Sudhir Kumar
标识
DOI:10.1038/s43588-021-00129-5
摘要
Felsenstein’s bootstrap approach is widely used to assess confidence in species relationships inferred from multiple sequence alignments. It resamples sites randomly with replacement to build alignment replicates of the same size as the original alignment and infers a phylogeny from each replicate dataset. The proportion of phylogenies recovering the same grouping of species is its bootstrap confidence limit. However, standard bootstrap imposes a high computational burden in applications involving long sequence alignments. Here, we introduce the bag of little bootstraps approach to phylogenetics, bootstrapping only a few little samples, each containing a small subset of sites. We report that the median-bagging of bootstrap confidence limits from little samples produces confidence in inferred species relationships similar to standard bootstrap but in a fraction of the computational time and memory. Therefore, the little bootstraps approach can potentially enhance the rigor, efficiency and parallelization of big data phylogenomic analyses. The authors show that accurate bootstrap confidence limits on inferred evolutionary relationships of species can be estimated by bootstrapping a collection of little samples of very long sequence alignments. Little bootstraps take a fraction of computer time and memory compared to the standard bootstrap, enabling big data analytics on personal computers.
科研通智能强力驱动
Strongly Powered by AbleSci AI