系统发育树
生物
基因家族
利用
选择(遗传算法)
树(集合论)
基因
计算生物学
自然选择
基因组
克莱德
连续性
进化生物学
遗传学
计算机科学
数学
机器学习
生态学
数学分析
计算机安全
作者
Pablo Librado,Pascale Roux
出处
期刊:Methods in molecular biology
日期:2022-01-01
卷期号:: 213-232
被引量:1
标识
DOI:10.1007/978-1-0716-2691-7_10
摘要
Estimating gene gain and losses is paramount to understand the molecular mechanisms underlying adaptive evolution. Despite the advent of high-throughput sequencing, such analyses have been so far hampered by the poor contiguity of genome assemblies. The increasing affordability of long-read sequencing technologies will however revolutionize our capacity to identify gene gains and losses at an unprecedented resolution, even in non-model organisms. To thoroughly exploit all such multigene family variation, the software BadiRate implements a collection of birth-and-death stochastic models, aiming at estimating by maximum likelihood the gene turnover rates along the internal and external branches of a given phylogenetic species tree. Its statistical framework also provides versatility for inferring the gene family content at the internal phylogenetic nodes (and to estimate the minimum number of gene gains and losses in each branch), for statistically contrasting competing hypotheses (e.g., accelerations of the gene turnover rates at pre-defined clades), and for pinpointing gene family expansions or contractions likely driven by natural selection. In this chapter we review the theoretical models implemented in BadiRate and illustrate their applicability by analyzing a hypothetical data set of 14 microbial species.
科研通智能强力驱动
Strongly Powered by AbleSci AI