背景(考古学)
计算生物学
后转座子
RNA序列
生物
转录组
计算机科学
基因
人工智能
遗传学
基因表达
基因组
转座因子
古生物学
作者
Ying Chen,Andre Sim,Yuk Kei Wan,Keith Yeo,Joseph Lee,Min Hao Ling,Michael I. Love,Jonathan Göke
出处
期刊:Nature Methods
[Springer Nature]
日期:2023-06-12
卷期号:20 (8): 1187-1195
被引量:48
标识
DOI:10.1038/s41592-023-01908-w
摘要
Most approaches to transcript quantification rely on fixed reference annotations; however, the transcriptome is dynamic and depending on the context, such static annotations contain inactive isoforms for some genes, whereas they are incomplete for others. Here we present Bambu, a method that performs machine-learning-based transcript discovery to enable quantification specific to the context of interest using long-read RNA-sequencing. To identify novel transcripts, Bambu estimates the novel discovery rate, which replaces arbitrary per-sample thresholds with a single, interpretable, precision-calibrated parameter. Bambu retains the full-length and unique read counts, enabling accurate quantification in presence of inactive isoforms. Compared to existing methods for transcript discovery, Bambu achieves greater precision without sacrificing sensitivity. We show that context-aware annotations improve quantification for both novel and known transcripts. We apply Bambu to quantify isoforms from repetitive HERVH-LTR7 retrotransposons in human embryonic stem cells, demonstrating the ability for context-specific transcript expression analysis. Leveraging long-read RNA-seq data and machine learning, Bambu facilitates accurate transcript discovery and quantification.
科研通智能强力驱动
Strongly Powered by AbleSci AI