符号
聚类分析
图形
计算机科学
非负矩阵分解
组合数学
数学
人工智能
矩阵分解
特征向量
算术
物理
量子力学
作者
Han-Jing Jiang,Mei-Neng Wang,Yu‐An Huang,Yabing Huang
出处
期刊:IEEE Journal of Biomedical and Health Informatics
[Institute of Electrical and Electronics Engineers]
日期:2024-05-24
卷期号:28 (8): 4986-4994
标识
DOI:10.1109/jbhi.2024.3400050
摘要
The advent of single-cell RNA sequencing (scRNA-seq) has brought forth fresh perspectives on intricate biological processes, revealing the nuances and divergences present among distinct cells. Accurate single-cell analysis is a crucial prerequisite for in-depth investigation into the underlying mechanisms of heterogeneity. Due to various technical noises, like the impact of dropout values, scRNA-seq data remains challenging to interpret. In this work, we propose an unsupervised learning framework for scRNA-seq data analysis (aka Sc-GNNMF). Based on the non-negativity and sparsity of scRNA-seq data, we propose employing graph-regularized non-negative matrix factorization (GNNMF) algorithm for the analysis of scRNA-seq data, which involves estimating cell-cell similarity and gene-gene similarity through Laplacian kernels and $p$ -nearest neighbor graphs ( $p$ -NNG). By assuming intrinsic geometric local invariance, we use a weighted $p$ -nearest known neighbors ( $p$ -NKN) of cell-cell interactions to guide the matrix decomposition process, promoting the closeness of cells with similar types in cell-gene data space and determining a more suitable embedding space for clustering. Sc-GNNMF demonstrates superior performance compared to other methods and maintains satisfactory compatibility and robustness, as evidenced by experiments on 11 real scRNA-seq datasets. Furthermore, Sc-GNNMF yields excellent results in clustering tasks, extracting useful gene markers, and pseudo-temporal analysis.
科研通智能强力驱动
Strongly Powered by AbleSci AI