基因型
RNA序列
生物
聚类分析
计算生物学
溶解
遗传学
核糖核酸
基因
转录组
基因表达
分子生物学
计算机科学
人工智能
作者
Haynes Heaton,Arthur M. Talman,Andrew Knights,Maria Imaz,Daniel J. Gaffney,Richard Durbin,Martin Hemberg,Mara Lawniczak
出处
期刊:Nature Methods
[Springer Nature]
日期:2020-05-04
卷期号:17 (6): 615-620
被引量:305
标识
DOI:10.1038/s41592-020-0820-1
摘要
Methods to deconvolve single-cell RNA-sequencing (scRNA-seq) data are necessary for samples containing a mixture of genotypes, whether they are natural or experimentally combined. Multiplexing across donors is a popular experimental design that can avoid batch effects, reduce costs and improve doublet detection. By using variants detected in scRNA-seq reads, it is possible to assign cells to their donor of origin and identify cross-genotype doublets that may have highly similar transcriptional profiles, precluding detection by transcriptional profile. More subtle cross-genotype variant contamination can be used to estimate the amount of ambient RNA. Ambient RNA is caused by cell lysis before droplet partitioning and is an important confounder of scRNA-seq analysis. Here we develop souporcell, a method to cluster cells using the genetic variants detected within the scRNA-seq reads. We show that it achieves high accuracy on genotype clustering, doublet detection and ambient RNA estimation, as demonstrated across a range of challenging scenarios. Souporcell clusters single-cell RNA-seq data using genotype information without the use of a genotype reference.
科研通智能强力驱动
Strongly Powered by AbleSci AI