RNA序列
计算生物学
剪接
基因亚型
纳米孔测序
核糖核酸
转录组
生物
选择性拼接
深度测序
基因
遗传学
DNA测序
基因表达
基因组
作者
Yuan Gao,Feng Wang,Robert Wang,Eric Kutschera,Yang Xu,Stephan Xie,Yuanyuan Wang,Kathryn E. Kadash-Edmondson,Lan Lin,Yi Xing
出处
期刊:Science Advances
[American Association for the Advancement of Science (AAAS)]
日期:2023-01-20
卷期号:9 (3)
被引量:26
标识
DOI:10.1126/sciadv.abq5072
摘要
Long-read RNA sequencing (RNA-seq) holds great potential for characterizing transcriptome variation and full-length transcript isoforms, but the relatively high error rate of current long-read sequencing platforms poses a major challenge. We present ESPRESSO, a computational tool for robust discovery and quantification of transcript isoforms from error-prone long reads. ESPRESSO jointly considers alignments of all long reads aligned to a gene and uses error profiles of individual reads to improve the identification of splice junctions and the discovery of their corresponding transcript isoforms. On both a synthetic spike-in RNA sample and human RNA samples, ESPRESSO outperforms multiple contemporary tools in not only transcript isoform discovery but also transcript isoform quantification. In total, we generated and analyzed ~1.1 billion nanopore RNA-seq reads covering 30 human tissue samples and three human cell lines. ESPRESSO and its companion dataset provide a useful resource for studying the RNA repertoire of eukaryotic transcriptomes.
科研通智能强力驱动
Strongly Powered by AbleSci AI