ScDB: A comprehensive database dedicated to Saccharum, facilitating functional genomics and molecular biology studies in sugarcane

糖精 多倍体 糖精 生物 基因组 基因组学 倍性 生物技术 遗传学 基因 植物
作者
Siyuan Chen,Xiaoxi Feng,Qian Zhang,Xiuting Hua,Qian Zhang,Chengjie Chen,Jiawei Li,Jing Wang,Chengyin Weng,Baoshan Chen,Muqing Zhang,Wei Yao,Haibao Tang,Ray Ming,Jisen Zhang
出处
期刊:Plant Biotechnology Journal [Wiley]
标识
DOI:10.1111/pbi.14457
摘要

Sugarcane is the world's important sugar crop, serving as the primary feedstock for the production of sugar and biofuels. Modern sugarcane cultivar resulting from deliberate interspecific hybridization between Saccharum officinarum and Saccharum spontaneum. The utilization of wild resources is essential for the development of high-quality sugarcane varieties, and the genomic and omics analyses of these materials provide valuable insights into their molecular mechanisms. However, the complexity of the sugarcane genome has historically presented challenges for researchers. In our previous studies, we led the efforts to assemble the genome of a haploid S. spontaneum AP85-441 (Zhang et al., 2018) and pioneered the approach to tackle a complex autopolyploid at allele-level resolution. We then traced the origins of Saccharum and mapped the chromosomal evolution in S. spontaneum Np-X (Zhang et al., 2022). Additionally, we successfully assembled a complete, gap-free diploid Erianthus rufipilus YN2009-3 genome, shedding light on the genomic footprints of evolution in the highly polyploid Saccharum (Wang et al., 2023). Meanwhile, we are proud to present the genome of Saccharum hybrid XTT22, considered the most significant achievement in sugarcane research. Our work is currently accepted and will soon be online (Zhang et al., Nature Genetics). In addition, other teams have similarly worked on genome research in the Sugarcane. This year, the genomes of modern sugarcane R570 and ZZ1 were published by A. D'Hont's team and Muqing Zhang's team, respectively (Bao et al., 2024; Healey et al., 2024). Building upon this foundation, we are pleased to introduce ScDB (Saccharum genomic database, https://sugarcane.gxu.edu.cn/scdb), the first user-friendly multi-omics database for six Saccharum species (AP85-441, Np-X, LA-Purple, XTT22, R570, ZZ1) and a Erianthus rufipilus (YN2009-3). ScDB currently comprises a total of 38.91 Gb of genomic assembly sequences, encompassing 1 366 608 genes. Additionally, ScDB includes 24 transcriptome projects involving over 300 sugarcane samples and approximately 2.5 TB of data. Furthermore, 12 online functions that are frequently used by users have been developed to facilitate the use of ScDB, include 'Gene Search', 'Orthologous Gene Search', 'Synteny Block', 'Genome Browser', 'Gene Expression', 'Co-expression Network', 'Blast', 'Primer', 'Sequence Fetch', 'Transcription Factors', 'Protein Interaction Network', 'Profile Inference' (Figure 1a). ScDB consists of a frontend web interface, a backend application server, a main database and a suite of tools for analysis and visualization. The database is an organized database into six main modules: 'Home', 'Genomics', 'Transcriptomics', 'Tools', 'Download' and 'Publication'. The homepage features an introduction to ScDB, an advanced search engine, descriptions of Saccharum species and Erianthus rufipilus, and links to various tools. The advanced search function enables users to search by gene ID, gene name, GO number and KEGG number (Figure 1b). The 'Genomics module' includes functions for 'Genome', 'Gene Search', 'Synteny Blocks' and 'Genome Browser'. The 'Genome' reveals Saccharum species and Erianthus rufipilus that have been sequenced, along with insights into their geographic distribution and evolutionary ties. Users can view detailed genomic information and images for each variety, as well as structural annotations for each chromosome. In the 'Gene Search' feature, users can look up several genes using either gene IDs or specific chromosome regions. The 'Search By Range' option includes a chromosome selection tool, making it easier for those who are less acquainted with the genome to navigate. The gene details page provides information on the location of genes, functional annotations, expression of various studies, Orthogroups genes, as well as CDS, proteins and upstream and downstream sequences (Figure 1c). The 'Orthologous Gene Search' module searches for homologous genes, allowing the entry of genes from species included in the ScDB, and Arabidopsis, rice and sorghum. The 'Synteny Block' can be used for a swift examination of the evolution and variety within large homologous gene segments and chromosome (Figure 1d). The 'Genome Browser' tool provides a fast and interactive genome browser for navigating large-scale high-throughput sequencing data under a genomic framework. The 'Transcriptomics module' offers search and visualization functionalities for gene expression (Figure 1e) and co-expression gene networks. In the 'Gene Expression', Users are facilitated to access expression data for a range of genes. Users have the freedom to select their preferred studies, select the expression units (either Transcripts Per Million or Fragments Per Kilobase Million), and customize the color scheme of the heatmap according to their preferences. The 'Tools' module includes functions for 'Blast', 'Primer', 'Sequence Fetch', 'Transcription Factors', 'Protein Interaction Network' and 'Profile Inference'. The 'Blast' tool performs homology searches with different data sets. 'Primer' is the primer design tool. 'Sequence Fetch' can be used to extract chromosome sequences from a specified region. In the 'Transcription Factors', we used iTAK (Zheng et al., 2016) software to identify transcription factor families and kinase families of Saccharum species and Erianthus rufipilus, users can click on the name of any transcription factor family or kinase family to view a list of all genes contained in that family and can also search for the gene family in which the gene belongs. In 'Protein Interaction Network', users can search protein interaction networks for specific genes by gene IDs. The results are presented in a table that can be saved in CSV files and also visualized as an interactive network diagram, which can also be saved as an SVG image. Users can search for motifs in the Jaspar database by matching gene ID, gene name and protein sequence in 'Profile Inference', and download meme format files that can be used for binding prediction with upstream sequences obtained from the gene details page (Figure 1f). 'Download' module provides chromosome data and annotations for download. In summary, we present ScDB, which encompasses genome assemblies, annotations and transcriptome data of six Saccharum species and Erianthus rufipilus. To enhance the usability and efficiency of data acquisition and analysis, ScDB also provides a suite of convenient modules for search, analysis and visualization. In the future, ScDB will continue to be updated, adding more sugarcane genome data and other levels of omics data (proteomics, epigenetics, ncRNA, etc.), as well as further data analysis tools to ensure that it is a powerful and sustainable sugarcane data collection and analysis platform. This work was supported by the National Key Research and Development program (2021YFF1000101 and 2021YFF1000104); This work is also supported by the National Natural Science Foundation of China (32272196). We express our gratitude to all the ScDB users for their support. The authors declare no conflicts of interest. J.Z. conceived the project; J.Z., S.C., X.L. and X.F. designed the database. S.C. and C.C. performed the coding of the website. X.F., S.C., T.H., Q.Z., C.C., J.L., Z.Z. and C.W. analysed the data. J.Z., X.F. and S.C. prepared the figures and wrote the manuscript. All authors read and approved the final manuscript. The data that support the findings of this study are openly available in Database Resource at https://sugarcane.gxu.edu.cn/scdb/download.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
CodeCraft应助lyh采纳,获得10
刚刚
1秒前
1秒前
merlin发布了新的文献求助10
2秒前
2秒前
PP发布了新的文献求助10
2秒前
3秒前
Blue发布了新的文献求助10
4秒前
5秒前
hu发布了新的文献求助20
6秒前
qiduoji完成签到 ,获得积分10
6秒前
6秒前
7秒前
搜集达人应助卑微小谢采纳,获得10
7秒前
传奇3应助luckype采纳,获得10
7秒前
7秒前
无花果应助zhiyifan采纳,获得10
7秒前
Rufina0720发布了新的文献求助10
8秒前
8秒前
lizishu应助科研通管家采纳,获得10
8秒前
8秒前
打打应助科研通管家采纳,获得10
8秒前
9秒前
9秒前
田様应助科研通管家采纳,获得30
9秒前
9秒前
科目三应助科研通管家采纳,获得10
9秒前
所所应助科研通管家采纳,获得30
9秒前
大个应助科研通管家采纳,获得10
9秒前
9秒前
文逸应助科研通管家采纳,获得10
9秒前
在水一方应助科研通管家采纳,获得10
9秒前
9秒前
充电宝应助科研通管家采纳,获得20
9秒前
9秒前
斯文败类应助科研通管家采纳,获得50
9秒前
9秒前
Owen应助科研通管家采纳,获得10
9秒前
9秒前
10秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Real Analysis: Theory of Measure and Integration (3rd Edition) Epub版 1200
AnnualResearch andConsultation Report of Panorama survey and Investment strategy onChinaIndustry 1000
卤化钙钛矿人工突触的研究 1000
Engineering for calcareous sediments : proceedings of the International Conference on Calcareous Sediments, Perth 15-18 March 1988 / edited by R.J. Jewell, D.C. Andrews 1000
Continuing Syntax 1000
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6260866
求助须知:如何正确求助?哪些是违规求助? 8082760
关于积分的说明 16888828
捐赠科研通 5332135
什么是DOI,文献DOI怎么找? 2838361
邀请新用户注册赠送积分活动 1815794
关于科研通互助平台的介绍 1669511