基因组
生物
克莱德
计算生物学
遗传多样性
结构变异
进化生物学
基因
参考基因组
遗传学
系统发育学
人口
人口学
社会学
作者
Murukarthick Jayakodi,Hyeonah Shim,Martin Mascher
标识
DOI:10.1146/annurev-arplant-090823-015358
摘要
A single reference genome does not fully capture species diversity. By contrast, a pangenome incorporates multiple genomes to capture the entire set of nonredundant genes in a given species, along with its genome diversity. New sequencing technologies enable researchers to produce multiple high-quality genome sequences and catalog diverse genetic variations with better precision. Pangenomic studies have detected structural variants in plant genomes, dissected the genetic architecture of agronomic traits, and helped unravel molecular underpinnings and evolutionary origins of plant phenotypes. The pangenome concept has further evolved into a so-called superpangenome that includes wild relatives within a genus or clade and shifted to graph-based reference systems. Nevertheless, building pangenomes and representing complex structural variants remain challenging in many crops. Standardized computing pipelines and common data structures are needed to compare and interpret pangenomes. The growing body of plant pangenomics data requires new algorithms, huge data storage capacity, and training to help researchers and breeders take advantage of newly discovered genes and genetic variants.
科研通智能强力驱动
Strongly Powered by AbleSci AI