TransIntegrator: capture nearly full protein-coding transcript variants via integrating Illumina and PacBio transcriptomes

转录组 基因组 基因 生物 从头转录组组装 计算生物学 Illumina染料测序 遗传学 RNA序列 DNA测序 基因表达
作者
Liu Zhe,Yangmei Qin,Hao Chen,Dan Shi,Mindong Zhong,Te An,Linshan Chen,Yiquan Wang,Fan Lin,Guang Li,Zhi-Liang Ji
出处
期刊:Briefings in Bioinformatics [Oxford University Press]
卷期号:24 (6)
标识
DOI:10.1093/bib/bbad334
摘要

Abstract Genes have the ability to produce transcript variants that perform specific cellular functions. However, accurately detecting all transcript variants remains a long-standing challenge, especially when working with poorly annotated genomes or without a known genome. To address this issue, we have developed a new computational method, TransIntegrator, which enables transcriptome-wide detection of novel transcript variants. For this, we determined 10 Illumina sequencing transcriptomes and a PacBio full-length transcriptome for consecutive embryo development stages of amphioxus, a species of great evolutionary importance. Based on the transcriptomes, we employed TransIntegrator to create a comprehensive transcript variant library, namely iTranscriptome. The resulting iTrancriptome contained 91 915 distinct transcript variants, with an average of 2.4 variants per gene. This substantially improved current amphioxus genome annotation by expanding the number of genes from 21 954 to 38 777. Further analysis manifested that the gene expansion was largely ascribed to integration of multiple Illumina datasets instead of involving the PacBio data. Moreover, we demonstrated an example application of TransIntegrator, via generating iTrancriptome, in aiding accurate transcriptome assembly, which significantly outperformed other hybrid methods such as IDP-denovo and Trinity. For user convenience, we have deposited the source codes of TransIntegrator on GitHub as well as a conda package in Anaconda. In summary, this study proposes an affordable but efficient method for reliable transcriptomic research in most species.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
初月朔完成签到,获得积分10
刚刚
雪白幻巧完成签到,获得积分10
刚刚
张雯雯完成签到,获得积分10
刚刚
周灿灿完成签到,获得积分10
1秒前
沐雨完成签到,获得积分10
1秒前
ssx完成签到,获得积分10
1秒前
冯大哥完成签到,获得积分10
1秒前
大可完成签到,获得积分10
3秒前
3秒前
万能图书馆应助QJZ采纳,获得10
3秒前
脑机接口完成签到,获得积分10
3秒前
3秒前
Llzaj完成签到,获得积分10
4秒前
会飞的猪qq完成签到,获得积分10
4秒前
可爱的函函应助hwezhu采纳,获得10
4秒前
小熊完成签到,获得积分10
4秒前
Sun完成签到,获得积分10
5秒前
lcsw发布了新的文献求助10
6秒前
王京华完成签到,获得积分10
6秒前
炙热芒果完成签到,获得积分10
6秒前
7秒前
孤独的画板完成签到 ,获得积分10
7秒前
7秒前
ray应助谎言桃采纳,获得20
7秒前
7秒前
川月完成签到,获得积分10
8秒前
随意完成签到,获得积分10
8秒前
cdercder应助房天川采纳,获得10
8秒前
Wei发布了新的文献求助10
8秒前
大聪明完成签到,获得积分10
8秒前
nematode发布了新的文献求助10
8秒前
yuta123完成签到,获得积分10
8秒前
刘闹闹完成签到 ,获得积分10
9秒前
10秒前
10秒前
satellite完成签到,获得积分10
11秒前
11秒前
Wei完成签到,获得积分10
11秒前
妮妮完成签到,获得积分10
11秒前
12秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Introduction to Cosmetic Formulation and Technology, 2nd Edition 400
Petrology and Plate Tectonics,2025 400
Burger's Medicinal Chemistry and Drug Discovery 400
Programming for Chemical Engineers Using C, C++, and MATLAB 320
Birth of Twins After Genome Editing for HIV Resistance 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6689340
求助须知:如何正确求助?哪些是违规求助? 8433130
关于积分的说明 18016643
捐赠科研通 5915335
什么是DOI,文献DOI怎么找? 2984255
邀请新用户注册赠送积分活动 1960276
关于科研通互助平台的介绍 1898418