聚类分析
空间分析
人工智能
计算机科学
信息瓶颈法
背景(考古学)
自编码
瓶颈
模式识别(心理学)
特征(语言学)
共识聚类
数据挖掘
深度学习
生物
相关聚类
CURE数据聚类算法
数学
哲学
嵌入式系统
古生物学
统计
语言学
作者
Xiang Lin,Le Gao,Nathan P. Whitener,Asraa Ahmed,Zhi Wei
标识
DOI:10.1101/gr.276477.121
摘要
Spatially resolved scRNA-seq (sp-scRNA-seq) technologies provide the potential to comprehensively profile gene expression patterns in tissue context. However, the development of computational methods lags behind the advances in these technologies, which limits the fulfillment of their potential. In this study, we develop a deep learning approach for clustering sp-scRNA-seq data, named Deep Spatially constrained Single-cell Clustering (DSSC). In this model, we integrate the spatial information of cells into the clustering process in two steps: (1) the spatial information is encoded by using a graphical neural network model, and (2) cell-to-cell constraints are built based on the spatial expression pattern of the marker genes and added in the model to guide the clustering process. Then, a deep embedding clustering is performed on the bottleneck layer of autoencoder by Kullback–Leibler (KL) divergence along with the learning of feature representation. DSSC is the first model that can use information from both spatial coordinates and marker genes to guide cell/spot clustering. Extensive experiments on both simulated and real data sets show that DSSC boosts clustering performance significantly compared with the state-of-the-art methods. It has robust performance across different data sets with various cell type/tissue organization and/or cell type/tissue spatial dependency. We conclude that DSSC is a promising tool for clustering sp-scRNA-seq data.
科研通智能强力驱动
Strongly Powered by AbleSci AI