计算机科学
生成模型
注释
表观遗传学
染色质
人工智能
计算生物学
概率逻辑
机器学习
生物
生成语法
遗传学
基因
基因表达
DNA甲基化
作者
Xiaoyang Chen,Shengquan Chen,Shuang Song,Zijing Gao,Lin Hou,Xuegong Zhang,Hairong Lv,Rui Jiang
标识
DOI:10.1038/s42256-021-00432-w
摘要
Recent advances in single-cell technologies have enabled the characterization of epigenomic heterogeneity at the cellular level. Computational methods for automatic cell type annotation are urgently needed given the exponential growth in the number of cells. In particular, annotation of single-cell chromatin accessibility sequencing (scCAS) data, which can capture the chromatin regulatory landscape that governs transcription in each cell type, has not been fully investigated. Here we propose EpiAnno, a probabilistic generative model integrated with a Bayesian neural network, to annotate scCAS data automatically in a supervised manner. We systematically validate the superior performance of EpiAnno for both intra- and inter-dataset annotation on various datasets. We further demonstrate the advantages of EpiAnno for interpretable embedding and biological implications via expression enrichment analysis, partitioned heritability analysis, enhancer identification, cis-coaccessibility analysis and pathway enrichment analysis. In addition, we show that EpiAnno has the potential to reveal cell type-specific motifs and facilitate scCAS data simulation.
科研通智能强力驱动
Strongly Powered by AbleSci AI