生物
航程(航空)
顺序装配
脚手架
基因组
计算生物学
遗传学
进化生物学
基因
数据库
计算机科学
工程类
航空航天工程
基因表达
转录组
作者
Kai Li,Melissa Smith,John C. Blazier,Kelli J. Kochan,Jonathan Wood,Kerstin Howe,Anne E. Kwitek,Melinda R. Dwinell,Hao Chen,Julia L. Ciosek,Patrick Masterson,Terence D. Murphy,Theodore S. Kalbfleisch,Peter A. Doris
标识
DOI:10.1101/gr.279292.124
摘要
We report the construction and analysis of a new reference genome assembly for Rattus norvegicus , the laboratory rat, a widely used experimental animal model organism. The assembly has been adopted as the rat reference assembly by the Genome Reference Consortium and is named GRCr8. The assembly has employed 40× Pacific Biosciences (PacBio) HiFi sequencing coverage and scaffolding using optical mapping and Hi-C. We used genomic DNA from a male BN/NHsdMcwi (BN) rat of the same strain and from the same colony as the prior reference assembly, mRatBN7.2. The assembly is at chromosome level with 98.7% of the sequence assigned to chromosomes. All chromosomes have increased in size compared with the prior assembly and k -mer analysis indicates that the subject animal is fully inbred and that the genome is represented as a single haploid assembly. Notable increases are observed in Chromosomes 3, 11, and 12 in the prospective rDNA regions. In addition, Chr Y has increased threefold in size and is more consistent with the rat karyotype than previous assemblies. Several other chromosomes have grown by the incorporation of sizable discrete new blocks. These contain highly repetitive sequences and encode numerous previously unannotated genes. In addition, centromeric sequences are incorporated in most chromosomes. Genome annotation has been performed by NCBI RefSeq, which confirms improvement in assembly quality and adds more than 1100 new protein coding genes. PacBio Iso-Seq data have been acquired from multiple tissues of the subject animal and are released concurrently with the new assembly to aid further analyses.
科研通智能强力驱动
Strongly Powered by AbleSci AI