自编码
降维
计算机科学
降噪
聚类分析
人工智能
维数(图论)
还原(数学)
模式识别(心理学)
高维数据聚类
维数之咒
噪音(视频)
数据挖掘
人工神经网络
数学
图像(数学)
几何学
纯数学
作者
Xiaoshu Zhu,Yongchang Lin,Jian Li,Jianxin Wang,Xiaoqing Peng
标识
DOI:10.1007/978-3-030-91415-8_45
摘要
Single-cell RNA-seq (scRNA-seq) data has provided a higher resolution of cellular heterogeneity. However, scRNA-seq data also brings some computational challenges for its high-dimension, high-noise, and high-sparseness. The dimension reduction is a crucial way to denoise and greatly reduce the computational complexity by representing the original data in a low-dimensional space. In this study, to achieve an accurate low-dimension representation, we proposed a denoising AutoEncoder based dimensionality reduction method for scRNA-seq data (ScDA), combining the denoising function with the AutoEncoder. ScDA is a deep unsupervised generative model, which models the dropout events and denoises the scRNA-seq data. Meanwhile, ScDA can reveal the nonlinear feature extraction of the original data through maximum distribution similarity before and after dimensionality reduction. Tested on 16 scRNA-seq datasets, ScDA provides superior average performances, and especially superior performances in large-scale datasets compared with 3 clustering methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI