计算机科学
可扩展性
鉴定(生物学)
机器学习
人工智能
数据挖掘
概率逻辑
生物
植物
数据库
作者
Tal Ashuach,Daniel A. Reidenbach,Adam Gayoso,Nir Yosef
标识
DOI:10.1016/j.crmeth.2022.100182
摘要
Single-cell ATAC sequencing (scATAC-seq) is a powerful and increasingly popular technique to explore the regulatory landscape of heterogeneous cellular populations. However, the high noise levels, degree of sparsity, and scale of the generated data make its analysis challenging. Here, we present PeakVI, a probabilistic framework that leverages deep neural networks to analyze scATAC-seq data. PeakVI fits an informative latent space that preserves biological heterogeneity while correcting batch effects and accounting for technical effects, such as library size and region-specific biases. In addition, PeakVI provides a technique for identifying differential accessibility at a single-region resolution, which can be used for cell-type annotation as well as identification of key cis-regulatory elements. We use public datasets to demonstrate that PeakVI is scalable, stable, robust to low-quality data, and outperforms current analysis methods on a range of critical analysis tasks. PeakVI is publicly available and implemented in the scvi-tools framework.
科研通智能强力驱动
Strongly Powered by AbleSci AI