生物
计算生物学
遗传学
DNA测序
基因
基因组
顺序装配
核糖核酸
康蒂格
进化生物学
转录组
作者
Tianpeng Chang,Bingxing An,Mang Liang,Xinghai Duan,Lili Du,Wentao Cai,Bo Zhu,Xue Gao,Yan Chen,Lingyang Xu,Lupei Zhang,Huijiang Gao,Junya Li
标识
DOI:10.3389/fgene.2021.664974
摘要
Cattle (Bos taurus) is one of the most widely distributed livestock species in the world, and provides us with high-quality milk and meat which have a huge impact on the quality of human life. Therefore, accurate and complete transcriptome and genome annotation are of great value to the research of cattle breeding. In this study, we used error-corrected PacBio single-molecule real-time (SMRT) data to perform whole-transcriptome profiling in cattle. Then, 22.5 Gb of subreads was generated, including 381,423 circular consensus sequences (CCSs), among which 276,295 full-length non-chimeric (FLNC) sequences were identified. After correction by Illumina short reads, we obtained 22,353 error-corrected isoforms. A total of 305 alternative splicing (AS) events and 3,795 alternative polyadenylation (APA) sites were detected by transcriptome structural analysis. Furthermore, we identified 457 novel genes, 120 putative transcription factors (TFs), and 569 novel long non-coding RNAs (lncRNAs). Taken together, this research improves our understanding and provides new insights into the complexity of full-length transcripts in cattle.
科研通智能强力驱动
Strongly Powered by AbleSci AI