聚类分析
计算机科学
数据挖掘
人工智能
编码(集合论)
源代码
领域(数学分析)
机器学习
维数之咒
辍学(神经网络)
高维数据聚类
人工神经网络
层次聚类
模式识别(心理学)
数学
数学分析
集合(抽象数据类型)
程序设计语言
操作系统
作者
Haiyun Wang,Jianping Zhao,Chun-Hou Zheng,Yansen Su
出处
期刊:Bioinformatics
[Oxford University Press]
日期:2022-06-14
卷期号:38 (15): 3703-3709
被引量:6
标识
DOI:10.1093/bioinformatics/btac393
摘要
A large number of studies have shown that clustering is a crucial step in scRNA-seq analysis. Most existing methods are based on unsupervised learning without the prior exploitation of any domain knowledge, which does not utilize available gold-standard labels. When confronted by the high dimensionality and general dropout events of scRNA-seq data, purely unsupervised clustering methods may not produce biologically interpretable clusters, which complicate cell type assignment.In this article, we propose a semi-supervised clustering method based on a capsule network named scCNC that integrates domain knowledge into the clustering step. Significantly, we also propose a Semi-supervised Greedy Iterative Training method used to train the whole network. Experiments on some real scRNA-seq datasets show that scCNC can significantly improve clustering performance and facilitate downstream analyses.The source code of scCNC is freely available at https://github.com/WHY-17/scCNC.Supplementary data are available at Bioinformatics online.
科研通智能强力驱动
Strongly Powered by AbleSci AI