HMOD: An Omics Database for Herbal Medicine Plants

生物 组学 传统医学 计算生物学 生物信息学 医学
作者
Xiao Wang,Jiajin Zhang,Simei He,Yuanni Gao,Xiaoqin Ma,Yun Gao,Guanghui Zhang,Ling Kui,Wen Wang,Ying Wang,Shengchao Yang,Yang Dong
出处
期刊:Molecular Plant [Elsevier]
卷期号:11 (5): 757-759 被引量:21
标识
DOI:10.1016/j.molp.2018.03.002
摘要

More than 50% of drugs are derived from chemical compounds that have been isolated from various plants (Fabricant and Farnsworth, 2001Fabricant D.S. Farnsworth N.R. The value of plants used in traditional medicine for drug discovery.Environ. Health Perspect. 2001; 109: 69-75Crossref PubMed Scopus (1344) Google Scholar, Yarnell and Abascal, 2002Yarnell E. Abascal K. Dilemmas of traditional botanical research.HerbalGram. 2002; 55: 46Google Scholar). With the development of sequencing technology and synthetic biology, we can obtain molecular information from the transcriptomic and genomic data of plants and then utilize bacteria to synthesize desired chemical compounds (Atanasov et al., 2015Atanasov A.G. Waltenberger B. Pferschy-Wenzig E.M. Linder T. Wawrosch C. Uhrin P. Temml V. Wang L. Schwaiger S. Heiss E.H. et al.Discovery and resupply of pharmacologically active plant-derived natural products: a review.Biotechnol. Adv. 2015; 33: 1582-1614Crossref PubMed Scopus (1437) Google Scholar, Smanski et al., 2016Smanski M.J. Zhou H. Claesen J. Shen B. Fischbach M.A. Voigt C.A. Synthetic biology to access and expand nature's chemical diversity.Nat. Rev. Microbiol. 2016; 14: 135-149Crossref PubMed Scopus (313) Google Scholar). Increasing numbers of researchers have started to publish omics data generated from herbal plants. However, there has been a concern that redundant data generation might occur, with some researchers expressing a desire for an all-inclusive reliable omics database for herbal medicine plants. Establishing such a database is of great importance in flourishing the research of the biogenesis and functions of herbal medicines (Yan et al., 2015Yan L. Wang X. Liu H. Tian Y. Lian J. Yang R. Hao S. Wang X. Yang S. Li Q. et al.The genome of Dendrobium officinale illuminates the biology of the important traditional Chinese orchid herb.Mol. Plant. 2015; 8: 922-934Abstract Full Text Full Text PDF PubMed Scopus (180) Google Scholar, Zhang et al., 2015Zhang G. Tian Y. Zhang J. Shu L. Yang S. Wang W. Sheng J. Dong Y. Chen W. Hybrid de novo genome assembly of the Chinese herbal plant danshen (Salvia miltiorrhiza Bunge).Gigascience. 2015; 4: 62Crossref PubMed Scopus (62) Google Scholar, Li et al., 2016Li J. Chen C. Wang Z.Z. The complete chloroplast genome of the Dendrobium strongylanthum (Orchidaceae: Epidendroideae).Mitochondrial DNA A DNA Mapp. Seq. Anal. 2016; 27: 3048-3049PubMed Google Scholar, Xu et al., 2016Xu H. Song J. Luo H. Zhang Y. Li Q. Zhu Y. Xu J. Li Y. Song C. Wang B. et al.Analysis of the genome sequence of the medicinal plant salvia miltiorrhiza.Mol. Plant. 2016; 9: 949-952Abstract Full Text Full Text PDF PubMed Scopus (190) Google Scholar). We have built the Herbal Medicine Omics Database (HMOD, Figure 1A, http://herbalplant.ynau.edu.cn/) to provide a reliable omics resource of herbal medicine plants for all researchers. In this database, we have cataloged the public-available genomic, transcriptomic, pathways data and metabolomics information of herbal medicine plants, as well as unpublished transcriptomic and enzyme data identified from KEGG annotation. Moreover, a generic genome browser (Gbrowse) has been integrated to allow the viewing of genome sequences. A BLAST tool is also provided in our database. To provide the latest advances and more analysis tools for herbal medicine plants, HMOD will be updated when new data are available (ftp://202.203.187.112:2222/). HMOD collects 23 published genomes of medicinal herbs including Panax notoginseng and other important species (Figure 1B and Supplemental Table 1) (Chen et al., 2017Chen W. Kui L. Zhang G. Zhu S. Zhang J. Wang X. Yang M. Huang H. Liu Y. Wang Y. et al.Whole-genome sequencing and analysis of the Chinese herbal plant panax notoginseng.Mol. Plant. 2017; 10: 899-902Abstract Full Text Full Text PDF PubMed Scopus (56) Google Scholar, Zhang et al., 2017Zhang D. Li W. Xia E.H. Zhang Q.J. Liu Y. Zhang Y. Tong Y. Zhao Y. Niu Y.C. Xu J.H. et al.The medicinal herb panax notoginseng genome provides insights into ginsenoside biosynthesis and genome evolution.Mol. Plant. 2017; 10: 903-907Abstract Full Text Full Text PDF PubMed Scopus (70) Google Scholar). The data for every species consist of an introduction, resequencing information, downloadable information, the Gbrowse internet browser, and BLAST. In the introduction of each herbal plant, we describe its basic areal distribution and pharmacological function. As there are still no published genome resequencing data for any medicinal herbs, single nucleotide polymorphism (SNP) information and analysis can only be added when available. For the downloadable data, we have summarized the published year, institution, sample information, sequencing platform, data size, assembly results, and annotation methods used in the projects (Figure 1C). Genomic data are contained in a fasta formatted genome file, with a cds file available in fasta format, and a protein data file available in both fasta and gff3 formats. All these files can be downloaded using ftp. The Gbrowse browser and BLAST tool are linked for further genetic and enzyme-based analysis. HMOD contains 172 transcriptomes in 57 plant families (124 published data and 48 de novo data sequenced, assembled, and annotated in this project; Figure 1D; Supplemental Tables 2 and 3). Similarly, the data for transcriptome components consist of an introduction, downloadable data, and BLAST. In the introduction, as before, we describe the basic areal distribution and pharmacological function of herbal plants. In the downloadable data, the published year, institution, sample information, sequencing platform, data size, assembly results, and annotation methods used in the projects are summarized. For the published transcriptomic data, the SRA data uploaded on NCBI have been linked in HMOD, and for de novo assembled data in this project, fasta formatted files for unigenes, cds, and protein sequences can be downloaded from this database. The de novo assembled transcriptomes have been linked to the BLAST tool. Eighteen main plant KEGG pathways information sources and other herbal plant-related websites are linked to HMOD (Figure 1E). We start the KEGG annotation with all the KEGG Orthology (KP) identifiers being retrieved and selected for these genomes and transcriptomes. We finish it with the gene name, KO, gene ID in omics data, math score, and gene description in tables, which can be downloaded. HMOD contains a summary of the metabolomic data for 55 metabolites (Figure 1F and Supplemental Table 4). These data have been summarized into 35 plant families, and the published year, institution, sample information, and results are also included in tables, which can help researchers learn about the advanced metabolomics research. Diverse bioinformatics tools are available from within HMOD. We used the Generic Genome Browser (GBrowse), developed as part of the Generic Model Organism Database project (GMOD; http://gmod.org/wiki/GMOD), to visualize genome sequences, repeat sequences, and predicted genes. A variety of tracking features can be accessed, including protein-coding genes, non-coding genes, GC content, and repetitive sequences (Figure 1G). BLAST is a useful tool that offers users the ability to search against scaffolds and genes in the herbal plant genomes and transcriptomes. On the results page for a BLAST search, each hit can be downloaded to view the sequence (Figure 1H). The search function provides a tool for finding omics information using the Latin names of target plants as keywords. In summary, HMOD provides a comprehensive set of omics data and KEGG pathway information for herbal medicine plants. HMOD will be updated regularly with new datasets being added and further improved with enhanced functionality in the future to provide a more valuable resource for facilitating comparative genomics, transcriptomes, and synthetic biology studies. The project was supported by research funds from National Natural Science Foundation of China (no. U1402262), major Science and Technique Programs in Yunnan Province (no. 2016ZF001) and the Project of Young and Middle-aged Talent of Yunnan Province (Grant No. 2014HB011).

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
moo发布了新的文献求助10
1秒前
天天快乐应助啾啾尼泊尔采纳,获得10
1秒前
kiko发布了新的文献求助10
2秒前
2秒前
2秒前
chen完成签到,获得积分10
2秒前
汉堡包应助老猫采纳,获得10
2秒前
Fengyun完成签到,获得积分10
3秒前
3秒前
Lemrain完成签到,获得积分10
3秒前
3秒前
思源应助故意的以旋采纳,获得10
3秒前
天天快乐应助求助人员采纳,获得30
4秒前
打打应助森水垚采纳,获得10
4秒前
4秒前
白rain发布了新的文献求助10
4秒前
周不是舟完成签到,获得积分10
4秒前
量子星尘发布了新的文献求助10
5秒前
晴雨发布了新的文献求助10
5秒前
5秒前
ghx应助阔达小蜜蜂采纳,获得10
5秒前
拼搏书完成签到,获得积分10
6秒前
赫连烙发布了新的文献求助10
7秒前
花尽发布了新的文献求助10
7秒前
科研通AI2S应助kaiki采纳,获得10
7秒前
天天玩完成签到,获得积分10
9秒前
pluto应助vivi采纳,获得10
9秒前
9秒前
打工肥仔应助zh_li采纳,获得10
9秒前
9秒前
wanci应助Loki采纳,获得10
10秒前
刘梦迪发布了新的文献求助10
10秒前
10秒前
迷途羊羔关注了科研通微信公众号
11秒前
NexusExplorer应助开放夏旋采纳,获得30
11秒前
12秒前
13秒前
14秒前
LG发布了新的文献求助30
14秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Handbook of pharmaceutical excipients, Ninth edition 5000
Aerospace Standards Index - 2026 ASIN2026 3000
Signals, Systems, and Signal Processing 610
Discrete-Time Signals and Systems 610
Social Work and Social Welfare: An Invitation(7th Edition) 410
Medical Management of Pregnancy Complicated by Diabetes 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6056216
求助须知:如何正确求助?哪些是违规求助? 7887807
关于积分的说明 16289972
捐赠科研通 5201605
什么是DOI,文献DOI怎么找? 2783156
邀请新用户注册赠送积分活动 1765984
关于科研通互助平台的介绍 1646793