聚类分析
离群值
稳健性(进化)
计算机科学
可视化
数据挖掘
子空间拓扑
代表(政治)
噪音(视频)
秩(图论)
模式识别(心理学)
人工智能
数学
图像(数学)
生物化学
政治
基因
组合数学
化学
法学
政治学
作者
Cui-Na Jiao,Jin‐Xing Liu,Juan Wang,Junliang Shang,Chun-Hou Zheng
出处
期刊:IEEE Journal of Biomedical and Health Informatics
[Institute of Electrical and Electronics Engineers]
日期:2022-04-01
卷期号:26 (4): 1872-1882
被引量:3
标识
DOI:10.1109/jbhi.2021.3110766
摘要
The exploration of single cell RNA-sequencing (scRNA-seq) technology generates a new perspective to analyze biological problems. One of the major applications of scRNA-seq data is to discover subtypes of cells by cell clustering. Nevertheless, it is challengeable for traditional methods to handle scRNA-seq data with high level of technical noise and notorious dropouts. To better analyze single cell data, a novel scRNA-seq data analysis model called Maximum correntropy criterion based Non-negative and Low Rank Representation (MccNLRR) is introduced. Specifically, the maximum correntropy criterion, as an effective loss function, is more robust to the high noise and large outliers existed in the data. Moreover, the low rank representation is proven to be a powerful tool for capturing the global and local structures of data. Therefore, some important information, such as the similarity of cells in the subspace, is also extracted by it. Then, an iterative algorithm on the basis of the half-quadratic optimization and alternating direction method is developed to settle the complex optimization problem. Before the experiment, we also analyze the convergence and robustness of MccNLRR. At last, the results of cell clustering, visualization analysis, and gene markers selection on scRNA-seq data reveal that MccNLRR method can distinguish cell subtypes accurately and robustly.
科研通智能强力驱动
Strongly Powered by AbleSci AI