ScDB: A comprehensive database dedicated to Saccharum, facilitating functional genomics and molecular biology studies in sugarcane

糖精 多倍体 糖精 生物 基因组 基因组学 倍性 生物技术 遗传学 基因 植物
作者
Siyuan Chen,Xiaoxi Feng,Qian Zhang,Xiuting Hua,Qian Zhang,Chengjie Chen,Jiawei Li,Jing Wang,Chengyin Weng,Baoshan Chen,Muqing Zhang,Wei Yao,Haibao Tang,Ray Ming,Jisen Zhang
出处
期刊:Plant Biotechnology Journal [Wiley]
标识
DOI:10.1111/pbi.14457
摘要

Sugarcane is the world's important sugar crop, serving as the primary feedstock for the production of sugar and biofuels. Modern sugarcane cultivar resulting from deliberate interspecific hybridization between Saccharum officinarum and Saccharum spontaneum. The utilization of wild resources is essential for the development of high-quality sugarcane varieties, and the genomic and omics analyses of these materials provide valuable insights into their molecular mechanisms. However, the complexity of the sugarcane genome has historically presented challenges for researchers. In our previous studies, we led the efforts to assemble the genome of a haploid S. spontaneum AP85-441 (Zhang et al., 2018) and pioneered the approach to tackle a complex autopolyploid at allele-level resolution. We then traced the origins of Saccharum and mapped the chromosomal evolution in S. spontaneum Np-X (Zhang et al., 2022). Additionally, we successfully assembled a complete, gap-free diploid Erianthus rufipilus YN2009-3 genome, shedding light on the genomic footprints of evolution in the highly polyploid Saccharum (Wang et al., 2023). Meanwhile, we are proud to present the genome of Saccharum hybrid XTT22, considered the most significant achievement in sugarcane research. Our work is currently accepted and will soon be online (Zhang et al., Nature Genetics). In addition, other teams have similarly worked on genome research in the Sugarcane. This year, the genomes of modern sugarcane R570 and ZZ1 were published by A. D'Hont's team and Muqing Zhang's team, respectively (Bao et al., 2024; Healey et al., 2024). Building upon this foundation, we are pleased to introduce ScDB (Saccharum genomic database, https://sugarcane.gxu.edu.cn/scdb), the first user-friendly multi-omics database for six Saccharum species (AP85-441, Np-X, LA-Purple, XTT22, R570, ZZ1) and a Erianthus rufipilus (YN2009-3). ScDB currently comprises a total of 38.91 Gb of genomic assembly sequences, encompassing 1 366 608 genes. Additionally, ScDB includes 24 transcriptome projects involving over 300 sugarcane samples and approximately 2.5 TB of data. Furthermore, 12 online functions that are frequently used by users have been developed to facilitate the use of ScDB, include 'Gene Search', 'Orthologous Gene Search', 'Synteny Block', 'Genome Browser', 'Gene Expression', 'Co-expression Network', 'Blast', 'Primer', 'Sequence Fetch', 'Transcription Factors', 'Protein Interaction Network', 'Profile Inference' (Figure 1a). ScDB consists of a frontend web interface, a backend application server, a main database and a suite of tools for analysis and visualization. The database is an organized database into six main modules: 'Home', 'Genomics', 'Transcriptomics', 'Tools', 'Download' and 'Publication'. The homepage features an introduction to ScDB, an advanced search engine, descriptions of Saccharum species and Erianthus rufipilus, and links to various tools. The advanced search function enables users to search by gene ID, gene name, GO number and KEGG number (Figure 1b). The 'Genomics module' includes functions for 'Genome', 'Gene Search', 'Synteny Blocks' and 'Genome Browser'. The 'Genome' reveals Saccharum species and Erianthus rufipilus that have been sequenced, along with insights into their geographic distribution and evolutionary ties. Users can view detailed genomic information and images for each variety, as well as structural annotations for each chromosome. In the 'Gene Search' feature, users can look up several genes using either gene IDs or specific chromosome regions. The 'Search By Range' option includes a chromosome selection tool, making it easier for those who are less acquainted with the genome to navigate. The gene details page provides information on the location of genes, functional annotations, expression of various studies, Orthogroups genes, as well as CDS, proteins and upstream and downstream sequences (Figure 1c). The 'Orthologous Gene Search' module searches for homologous genes, allowing the entry of genes from species included in the ScDB, and Arabidopsis, rice and sorghum. The 'Synteny Block' can be used for a swift examination of the evolution and variety within large homologous gene segments and chromosome (Figure 1d). The 'Genome Browser' tool provides a fast and interactive genome browser for navigating large-scale high-throughput sequencing data under a genomic framework. The 'Transcriptomics module' offers search and visualization functionalities for gene expression (Figure 1e) and co-expression gene networks. In the 'Gene Expression', Users are facilitated to access expression data for a range of genes. Users have the freedom to select their preferred studies, select the expression units (either Transcripts Per Million or Fragments Per Kilobase Million), and customize the color scheme of the heatmap according to their preferences. The 'Tools' module includes functions for 'Blast', 'Primer', 'Sequence Fetch', 'Transcription Factors', 'Protein Interaction Network' and 'Profile Inference'. The 'Blast' tool performs homology searches with different data sets. 'Primer' is the primer design tool. 'Sequence Fetch' can be used to extract chromosome sequences from a specified region. In the 'Transcription Factors', we used iTAK (Zheng et al., 2016) software to identify transcription factor families and kinase families of Saccharum species and Erianthus rufipilus, users can click on the name of any transcription factor family or kinase family to view a list of all genes contained in that family and can also search for the gene family in which the gene belongs. In 'Protein Interaction Network', users can search protein interaction networks for specific genes by gene IDs. The results are presented in a table that can be saved in CSV files and also visualized as an interactive network diagram, which can also be saved as an SVG image. Users can search for motifs in the Jaspar database by matching gene ID, gene name and protein sequence in 'Profile Inference', and download meme format files that can be used for binding prediction with upstream sequences obtained from the gene details page (Figure 1f). 'Download' module provides chromosome data and annotations for download. In summary, we present ScDB, which encompasses genome assemblies, annotations and transcriptome data of six Saccharum species and Erianthus rufipilus. To enhance the usability and efficiency of data acquisition and analysis, ScDB also provides a suite of convenient modules for search, analysis and visualization. In the future, ScDB will continue to be updated, adding more sugarcane genome data and other levels of omics data (proteomics, epigenetics, ncRNA, etc.), as well as further data analysis tools to ensure that it is a powerful and sustainable sugarcane data collection and analysis platform. This work was supported by the National Key Research and Development program (2021YFF1000101 and 2021YFF1000104); This work is also supported by the National Natural Science Foundation of China (32272196). We express our gratitude to all the ScDB users for their support. The authors declare no conflicts of interest. J.Z. conceived the project; J.Z., S.C., X.L. and X.F. designed the database. S.C. and C.C. performed the coding of the website. X.F., S.C., T.H., Q.Z., C.C., J.L., Z.Z. and C.W. analysed the data. J.Z., X.F. and S.C. prepared the figures and wrote the manuscript. All authors read and approved the final manuscript. The data that support the findings of this study are openly available in Database Resource at https://sugarcane.gxu.edu.cn/scdb/download.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
无限天抒关注了科研通微信公众号
1秒前
echasl73发布了新的文献求助10
1秒前
Lin完成签到,获得积分10
2秒前
www完成签到 ,获得积分10
3秒前
神木丽完成签到,获得积分20
5秒前
7秒前
CipherSage应助Sene采纳,获得10
8秒前
10秒前
12秒前
15秒前
无限天抒发布了新的文献求助30
15秒前
蔡蔡发布了新的文献求助10
17秒前
Hz123456完成签到,获得积分10
17秒前
17秒前
KK完成签到,获得积分10
19秒前
小熊完成签到,获得积分10
19秒前
1111111完成签到 ,获得积分10
19秒前
Hz123456发布了新的文献求助10
20秒前
墨阳初晴发布了新的文献求助10
20秒前
深情安青应助褪山海采纳,获得10
21秒前
着急的滑板完成签到,获得积分10
21秒前
mengdi完成签到 ,获得积分10
22秒前
kiki完成签到,获得积分10
22秒前
橙子完成签到,获得积分10
22秒前
不安青牛应助小熊采纳,获得10
22秒前
23秒前
南大研究生完成签到 ,获得积分10
25秒前
lareina完成签到 ,获得积分10
25秒前
罗大大完成签到 ,获得积分10
26秒前
秋殇浅寞完成签到,获得积分10
27秒前
香蕉觅云应助科研通管家采纳,获得10
27秒前
BitBong完成签到,获得积分10
27秒前
李爱国应助科研通管家采纳,获得10
27秒前
彭于晏应助科研通管家采纳,获得10
27秒前
所所应助科研通管家采纳,获得10
27秒前
Lucas应助科研通管家采纳,获得10
28秒前
28秒前
桐桐应助科研通管家采纳,获得10
28秒前
28秒前
Lucas应助科研通管家采纳,获得10
28秒前
高分求助中
Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 2000
How to Create Beauty: De Lairesse on the Theory and Practice of Making Art 1000
Gerard de Lairesse : an artist between stage and studio 670
大平正芳: 「戦後保守」とは何か 550
2019第三届中国LNG储运技术交流大会论文集 500
Contributo alla conoscenza del bifenile e dei suoi derivati. Nota XV. Passaggio dal sistema bifenilico a quello fluorenico 500
Multiscale Thermo-Hydro-Mechanics of Frozen Soil: Numerical Frameworks and Constitutive Models 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 2998259
求助须知:如何正确求助?哪些是违规求助? 2658819
关于积分的说明 7197938
捐赠科研通 2294325
什么是DOI,文献DOI怎么找? 1216550
科研通“疑难数据库(出版商)”最低求助积分说明 593547
版权声明 592904