作者
Kunbo Wang,Zhiwen Wang,Fuguang Li,Wuwei Ye,Junyi Wang,Guoli Song,Zhen Yue,Lin Cong,Hǎihóng Shāng,Shilin Zhu,Changsong Zou,Qin Li,Yǒulù Yuán,Changming Lu,Hengling Wei,Caiyun Gou,Zequn Zheng,Ye Yin,Xueyan Zhang,Kun Liu,Bo Wang,Chi Song,Nan Song,R. J. Kohel,Richard G. Percy,John Z. Yu,Yuxian Zhu,Jun Wang,Shuang Yu
摘要
We have sequenced and assembled a draft genome of G. raimondii, whose progenitor is the putative contributor of the D subgenome to the economically important fiber-producing cotton species Gossypium hirsutum and Gossypium barbadense. Over 73% of the assembled sequences were anchored on 13 G. raimondii chromosomes. The genome contains 40,976 protein-coding genes, with 92.2% of these further confirmed by transcriptome data. Evidence of the hexaploidization event shared by the eudicots as well as of a cotton-specific whole-genome duplication approximately 13-20 million years ago was observed. We identified 2,355 syntenic blocks in the G. raimondii genome, and we found that approximately 40% of the paralogous genes were present in more than 1 block, which suggests that this genome has undergone substantial chromosome rearrangement during its evolution. Cotton, and probably Theobroma cacao, are the only sequenced plant species that possess an authentic CDN1 gene family for gossypol biosynthesis, as revealed by phylogenetic analysis.