聚类分析
层次聚类
度量(数据仓库)
计算机科学
鉴定(生物学)
共识聚类
距离测量
完整的链接聚类
单连锁聚类
相关聚类
人工智能
数据挖掘
模式识别(心理学)
生物
CURE数据聚类算法
植物
作者
Chunxiang Wang,Mu Zhang,Chaozhou Mou,Hongyu Zheng,Juntao Liu
摘要
The development of single-cell ribonucleic acid (RNA) sequencing (scRNA-seq) technology has led to great opportunities for the identification of heterogeneous cell types in complex tissues. Clustering algorithms are of great importance to effectively identify different cell types. In addition, the definition of the distance between each two cells is a critical step for most clustering algorithms. In this study, we found that different distance measures have considerably different effects on clustering algorithms. Moreover, there is no specific distance measure that is applicable to all datasets. In this study, we introduce a new single-cell clustering method called SD-h, which generates an applicable distance measure for different kinds of datasets by optimally synthesizing commonly used distance measures. Then, hierarchical clustering is performed based on the new distance measure for more accurate cell-type clustering. SD-h was tested on nine frequently used scRNA-seq datasets and it showed great superiority over almost all the compared leading single-cell clustering algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI