虚假关系
计算生物学
基因
基因表达
鉴定(生物学)
生物
核糖核酸
表达式(计算机科学)
极限(数学)
细胞
RNA序列
生物系统
计算机科学
模式识别(心理学)
遗传学
人工智能
转录组
数学
机器学习
植物
数学分析
程序设计语言
作者
Christopher S. McGinnis,Lyndsay M. Murrow,Zev J. Gartner
出处
期刊:Cell systems
[Elsevier]
日期:2019-04-01
卷期号:8 (4): 329-337.e4
被引量:2300
标识
DOI:10.1016/j.cels.2019.03.003
摘要
Single-cell RNA sequencing (scRNA-seq) data are commonly affected by technical artifacts known as "doublets," which limit cell throughput and lead to spurious biological conclusions. Here, we present a computational doublet detection tool-DoubletFinder-that identifies doublets using only gene expression data. DoubletFinder predicts doublets according to each real cell's proximity in gene expression space to artificial doublets created by averaging the transcriptional profile of randomly chosen cell pairs. We first use scRNA-seq datasets where the identity of doublets is known to show that DoubletFinder identifies doublets formed from transcriptionally distinct cells. When these doublets are removed, the identification of differentially expressed genes is enhanced. Second, we provide a method for estimating DoubletFinder input parameters, allowing its application across scRNA-seq datasets with diverse distributions of cell types. Lastly, we present "best practices" for DoubletFinder applications and illustrate that DoubletFinder is insensitive to an experimentally validated kidney cell type with "hybrid" expression features.
科研通智能强力驱动
Strongly Powered by AbleSci AI