基因亚型
选择性拼接
RNA剪接
RNA序列
生物
串联(数学)
计算生物学
核糖核酸
基因
内含子
遗传学
转录组
基因表达
互补DNA
数学
组合数学
作者
Aziz Al’Khafaji,Jonathan T. Smith,Kiran Garimella,Mehrtash Babadi,Moshe Sade-Feldman,Michael Gatzen,Siranush Sarkizova,Marc A. Schwartz,Victoria Popic,Emily M. Blaum,Allyson Day,Maura Costello,Tera Bowers,Stacey Gabriel,Eric Banks,Anthony Philippakis,Genevieve M. Boland,Paul C. Blainey,Nir Hacohen
标识
DOI:10.1101/2021.10.01.462818
摘要
Abstract Alternative splicing is a core biological process that enables profound and essential diversification of gene function. Short-read RNA sequencing approaches fail to resolve RNA isoforms and therefore primarily enable gene expression measurements - an isoform unaware representation of the transcriptome. Conversely, full-length RNA sequencing using long-read technologies are able to capture complete transcript isoforms, but their utility is deeply constrained due to throughput limitations. Here, we introduce MAS-ISO-seq, a technique for programmably concatenating cDNAs into single molecules optimal for long-read sequencing, boosting the throughput >15 fold to nearly 40 million cDNA reads per run on the Sequel IIe sequencer. We validated unambiguous isoform assignment with MAS-ISO-seq using a synthetic RNA isoform library and applied this approach to single-cell RNA sequencing of tumor-infiltrating T cells. Results demonstrated a >30 fold boosted discovery of differentially spliced genes and robust cell clustering, as well as canonical PTPRC splicing patterns across T cell subpopulations and the concerted expression of the associated hnRNPLL splicing factor. Methods such as MAS-ISO-seq will drive discovery of novel isoforms and the transition from gene expression to transcript isoform expression analyses.
科研通智能强力驱动
Strongly Powered by AbleSci AI