作者
Ming Li,Congjiao Sun,Naiyi Xu,Peipei Bian,Xiaomeng Tian,Xihong Wang,Yuzhe Wang,Xinzheng Jia,Rasmus Heller,Mingshan Wang,Fei Wang,Xuelei Dai,Rongsong Luo,Yingwei Guo,Xiangnan Wang,Peng Yang,Dexiang Hu,Zhenyu Liu,Weiwei Fu,Shunjin Zhang,Xiaochang Li,Chaoliang Wen,Fangren Lan,Amam Zonaed Siddiki,Chatmongkon Suwannapoom,Xin Zhao,Qinghua Nie,Xiaoxiang Hu,Yu Jiang,Ning Yang
摘要
The gene numbers and evolutionary rates of birds were assumed to be much lower than those of mammals, which is in sharp contrast to the huge species number and morphological diversity of birds. It is, therefore, necessary to construct a complete avian genome and analyze its evolution. We constructed a chicken pan-genome from 20 de novo assembled genomes with high sequencing depth, and identified 1,335 protein-coding genes and 3,011 long noncoding RNAs not found in GRCg6a. The majority of these novel genes were detected across most individuals of the examined transcriptomes but were seldomly measured in each of the DNA sequencing data regardless of Illumina or PacBio technology. Furthermore, different from previous pan-genome models, most of these novel genes were overrepresented on chromosomal subtelomeric regions and microchromosomes, surrounded by extremely high proportions of tandem repeats, which strongly blocks DNA sequencing. These hidden genes were proved to be shared by all chicken genomes, included many housekeeping genes, and enriched in immune pathways. Comparative genomics revealed the novel genes had 3-fold elevated substitution rates than known ones, updating the knowledge about evolutionary rates in birds. Our study provides a framework for constructing a better chicken genome, which will contribute toward the understanding of avian evolution and the improvement of poultry breeding.