聚类分析
计算机科学
稳健性(进化)
人工智能
过度拟合
数据挖掘
特征学习
机器学习
降维
推论
图形
共识聚类
模式识别(心理学)
相关聚类
CURE数据聚类算法
人工神经网络
理论计算机科学
基因
生物化学
化学
作者
Shengwen Tian,Jiancheng Ni,Yutian Wang,Chun-Hou Zheng,Cun-Mei Ji
出处
期刊:IEEE Journal of Biomedical and Health Informatics
[Institute of Electrical and Electronics Engineers]
日期:2023-09-26
卷期号:27 (12): 6133-6143
被引量:1
标识
DOI:10.1109/jbhi.2023.3319551
摘要
Single-cell RNA sequencing (scRNA-seq) has rapidly emerged as a powerful technique for analyzing cellular heterogeneity at the individual cell level. In the analysis of scRNA-seq data, cell clustering is a critical step in downstream analysis, as it enables the identification of cell types and the discovery of novel cell subtypes. However, the characteristics of scRNA-seq data, such as high dimensionality and sparsity, dropout events and batch effects, present significant computational challenges for clustering analysis. In this study, we propose scGCC, a novel graph self-supervised contrastive learning model, to address the challenges faced in scRNA-seq data analysis. scGCC comprises two main components: a representation learning module and a clustering module. The scRNA-seq data is first fed into a representation learning module for training, which is then used for data classification through a clustering module. scGCC can learn low-dimensional denoised embeddings, which is advantageous for our clustering task. We introduce Graph Attention Networks (GAT) for cell representation learning, which enables better feature extraction and improved clustering accuracy. Additionally, we propose five data augmentation methods to improve clustering performance by increasing data diversity and reducing overfitting. These methods enhance the robustness of clustering results. Our experimental study on 14 real-world datasets has demonstrated that our model achieves extraordinary accuracy and robustness. We also perform downstream tasks, including batch effect removal, trajectory inference, and marker genes analysis, to verify the biological effectiveness of our model.
科研通智能强力驱动
Strongly Powered by AbleSci AI