聚类分析
计算机科学
源代码
卷积(计算机科学)
图形
维数之咒
编码(集合论)
数据挖掘
人工智能
模式识别(心理学)
理论计算机科学
人工神经网络
操作系统
集合(抽象数据类型)
程序设计语言
作者
Shudong Wang,Y. Zhang,Yulin Zhang,Wenhao Wu,Lan Ye,Yunyin Li,Jionglong Su,Shanchen Pang
标识
DOI:10.1016/j.compbiomed.2023.107152
摘要
Single-cell RNA sequencing (scRNA-seq) is now a successful technique for identifying cellular heterogeneity, revealing novel cell subpopulations, and forecasting developmental trajectories. A crucial component of the processing of scRNA-seq data is the precise identification of cell subpopulations. Although many unsupervised clustering methods have been developed to cluster cell subpopulations, the performance of these methods is vulnerable to dropouts and high dimensionality. In addition, most existing methods are time-consuming and fail to adequately account for potential associations between cells. In the manuscript, we present an unsupervised clustering method based on an adaptive simplified graph convolution model called scASGC. The proposed method builds plausible cell graphs, aggregates neighbor information using a simplified graph convolution model, and adaptively determines the most optimal number of convolution layers for various graphs. Experiments on 12 public datasets show that scASGC outperforms both classical and state-of-the-art clustering methods. In addition, in a study of mouse intestinal muscle containing 15,983 cells, we identified distinct marker genes based on the clustering results of scASGC. The source code of scASGC is available at https://github.com/ZzzOctopus/scASGC.
科研通智能强力驱动
Strongly Powered by AbleSci AI