系统基因组学
树(集合论)
推论
生物
超级矩阵
最大似然
系统发育学
进化生物学
计算机科学
人工智能
数学
统计
组合数学
基因
遗传学
克莱德
当前代数
仿射李代数
纯数学
域代数上的
作者
Chao Liu,Xiaofan Zhou,Yuanning Li,Chris Todd Hittinger,Ronghui Pan,Jinyan Huang,Xue‐Xin Chen,Antonis Rokas,Yun Chen,Xing‐Xing Shen
标识
DOI:10.1093/sysbio/syae031
摘要
Abstract Maximum likelihood (ML) phylogenetic inference is widely used in phylogenomics. As heuristic searches most likely find suboptimal trees, it is recommended to conduct multiple (e.g., 10) tree searches in phylogenetic analyses. However, beyond its positive role, how and to what extent multiple tree searches aid ML phylogenetic inference remains poorly explored. Here, we found that a random starting tree was not as effective as the BioNJ and parsimony starting trees in inferring the ML gene tree and that RAxML-NG and PhyML were less sensitive to different starting trees than IQ-TREE. We then examined the effect of the number of tree searches on ML tree inference with IQ-TREE and RAxML-NG, by running 100 tree searches on 19,414 gene alignments from 15 animal, plant, and fungal phylogenomic datasets. We found that the number of tree searches substantially impacted the recovery of the best-of-100 ML gene tree topology among 100 searches for a given ML program. In addition, all of the concatenation-based trees were topologically identical if the number of tree searches was ≥10. Quartet-based ASTRAL trees inferred from 1 to 80 tree searches differed topologically from those inferred from 100 tree searches for 6/15 phylogenomic datasets. Finally, our simulations showed that gene alignments with lower difficulty scores had a higher chance of finding the best-of-100 gene tree topology and were more likely to yield the correct trees.
科研通智能强力驱动
Strongly Powered by AbleSci AI