作者
Wenyu Shi,Qinglan Sun,Guomei Fan,Hideaki Sugawara,Moriya Ohkuma,Takashi Itoh,Yu‐Guang Zhou,Man Cai,Song-Gun Kim,Jung‐Sook Lee,Ivo Sedláček,David R. Arahal,Teresa Lucena,Hiroko Kawasaki,Lyudmila I. Evtushenko,Bevan Weir,Sarah Alexander,Dénes Dlauchy,Somboon Tanasupawat,Lily Eurwilaichitr,Supawadee Ingsriswang,Bruno Gómez‐Gil,Manzour Hernando Hazbón,Marco A. Riojas,Chatrudee Suwannachart,Su Yao,Peter Vandamme,Fang Peng,Zenghui Chen,Dongmei Liu,Xiuqiang Sun,Xinjiao Zhang,Yuanchun Zhou,Zhen Meng,Linhuan Wu,Juncai Ma
摘要
Abstract Taxonomic and functional research of microorganisms has increasingly relied upon genome-based data and methods. As the depository of the Global Catalogue of Microorganisms (GCM) 10K prokaryotic type strain sequencing project, Global Catalogue of Type Strain (gcType) has published 1049 type strain genomes sequenced by the GCM 10K project which are preserved in global culture collections with a valid published status. Additionally, the information provided through gcType includes >12 000 publicly available type strain genome sequences from GenBank incorporated using quality control criteria and standard data annotation pipelines to form a high-quality reference database. This database integrates type strain sequences with their phenotypic information to facilitate phenotypic and genotypic analyses. Multiple formats of cross-genome searches and interactive interfaces have allowed extensive exploration of the database's resources. In this study, we describe web-based data analysis pipelines for genomic analyses and genome-based taxonomy, which could serve as a one-stop platform for the identification of prokaryotic species. The number of type strain genomes that are published will continue to increase as the GCM 10K project increases its collaboration with culture collections worldwide. Data of this project is shared with the International Nucleotide Sequence Database Collaboration. Access to gcType is free at http://gctype.wdcm.org/.