RNA剪接
标杆管理
计算机科学
计算生物学
事件(粒子物理)
选择性拼接
鉴定(生物学)
数据挖掘
生物
核糖核酸
基因
遗传学
信使核糖核酸
物理
植物
营销
量子力学
业务
作者
Minghao Jiang,Shi‐Yan Zhang,Hongxin Yin,Zhiyi Zhuo,Guoyu Meng
摘要
Abstract RNA alternative splicing, a post-transcriptional stage in eukaryotes, is crucial in cellular homeostasis and disease processes. Due to the rapid development of the next-generation sequencing (NGS) technology and the flood of NGS data, the detection of differential splicing from RNA-seq data has become mainstream. A range of bioinformatic tools has been developed. However, until now, an independent and comprehensive comparison of available algorithms/tools at the event level is still lacking. Here, 21 different tools are subjected to systematic evaluation, based on simulated RNA-seq data where exact differential splicing events are introduced. We observe immense discrepancies among these tools. SUPPA, DARTS, rMATS and LeafCutter outperforme other event-based tools. We also examine the abilities of the tools to identify novel splicing events, which shows that most event-based tools are unsuitable for discovering novel splice sites. To improve the overall performance, we present two methodological approaches i.e. low-expression transcript filtering and tool-pair combination. Finally, a new protocol of selecting tools to perform differential splicing analysis for different analytical tasks (e.g. precision and recall rate) is proposed. Under this protocol, we analyze the distinct splicing landscape in the DUX4/IGH subgroup of B-cell acute lymphoblastic leukemia and uncover the differential splicing of TCF12. All codes needed to reproduce the results are available at https://github.com/mhjiang97/Benchmarking_DS.
科研通智能强力驱动
Strongly Powered by AbleSci AI