聚类分析
计算机科学
数据挖掘
数据集成
双聚类
冗余(工程)
计算生物学
机器学习
模糊聚类
生物
CURE数据聚类算法
操作系统
作者
Liang-Rui Ren,Jun Wang,Zhao Li,Qingzhong Li,Guoxian Yu
出处
期刊:Bioinformatics
[Oxford University Press]
日期:2023-03-17
卷期号:39 (4)
被引量:10
标识
DOI:10.1093/bioinformatics/btad133
摘要
Abstract Motivation The integration of single-cell multi-omics data can uncover the underlying regulatory basis of diverse cell types and states. However, contemporary methods disregard the omics individuality, and the high noise, sparsity, and heterogeneity of single-cell data also impact the fusion effect. Furthermore, available single-cell clustering methods only focus on the cell type clustering, which cannot mine the alternative clustering to comprehensively analyze cells. Results We propose a single-cell data fusion based multiple clustering (scMCs) approach that can jointly model single-cell transcriptomics and epigenetic data, and explore multiple different clusterings. scMCs first mines the omics-specific and cross-omics consistent representations, then fuses them into a co-embedding representation, which can dissect cellular heterogeneity and impute data. To discover the potential alternative clustering embedded in multi-omics, scMCs projects the co-embedding representation into different salient subspaces. Meanwhile, it reduces the redundancy between subspaces to enhance the diversity of alternative clusterings and optimizes the cluster centers in each subspace to boost the quality of corresponding clustering. Unlike single clustering, these alternative clusterings provide additional perspectives for understanding complex genetic information, such as cell types and states. Experimental results show that scMCs can effectively identify subcellular types, impute dropout events, and uncover diverse cell characteristics by giving different but meaningful clusterings. Availability and implementation The code is available at www.sdu-idea.cn/codes.php?name=scMCs.
科研通智能强力驱动
Strongly Powered by AbleSci AI