作者
Stephen A. Goff,Darrell O. Ricke,Tian Lan,Gernot G. Presting,Ronglin Wang,Mary Dunn,Jane Glazebrook,Allen Sessions,Paul W. Oeller,Hemant Varma,David Hadley,Don Hutchison,Chris Martin,Fumiaki Katagiri,B. Markus Lange,Todd Moughamer,Yu Xia,Paul Budworth,Jingping Zhong,Trini Miguel,Uta Paszkowski,Shiping Zhang,Michelle Colbert,Weilin Sun,Runsheng Chen,Bret Cooper,Sylvia Park,Todd Charles Wood,Long Mao,Peter H. Quail,Rod A. Wing,Ralph A. Dean,Yeisoo Yu,Andrey Zharkikh,Richard Shen,Sudhir Sahasrabudhe,Alun Thomas,Rob Cannings,Alexander Gutin,Dmitry Pruss,Julia Reid,Sean V. Tavtigian,Jeff T. Mitchell,Glenn Eldredge,Terri Scholl,Rose Mary Miller,S. K. Bhatnagar,Nils B. Adey,Todd Rubano,Nadeem Tusneem,Rosann Robinson,Jane Feldhaus,Teresita Macalma,Arnold Oliphant,Steven P. Briggs
摘要
We have produced a draft sequence of the rice genome for the most widely cultivated subspecies in China, Oryza sativa L. ssp. indica, by whole-genome shotgun sequencing. The genome was 466 megabases in size, with an estimated 46,022 to 55,615 genes. Functional coverage in the assembled sequences was 92.0%. About 42.2% of the genome was in exact 20-nucleotide oligomer repeats, and most of the transposons were in the intergenic regions between genes. Although 80.6% of predicted Arabidopsis thaliana genes had a homolog in rice, only 49.4% of predicted rice genes had a homolog in A. thaliana. The large proportion of rice genes with no recognizable homologs is due to a gradient in the GC content of rice coding sequences.