自编码
计算机科学
特征(语言学)
人工智能
图形
相似性(几何)
降维
数据挖掘
模式识别(心理学)
机器学习
深度学习
理论计算机科学
语言学
图像(数学)
哲学
作者
Dai-Jun Zhang,Ying-Lian Gao,Jing-Xiu Zhao,Chun-Hou Zheng,Jin‐Xing Liu
标识
DOI:10.1109/tnnls.2022.3190289
摘要
Single-cell RNA sequencing (scRNA-seq) technology is famous for providing a microscopic view to help capture cellular heterogeneity. This characteristic has advanced the field of genomics by enabling the delicate differentiation of cell types. However, the properties of single-cell datasets, such as high dropout events, noise, and high dimensionality, are still a research challenge in the single-cell field. To utilize single-cell data more efficiently and to better explore the heterogeneity among cells, a new graph autoencoder (GAE)-based consensus-guided model (scGAC) is proposed in this article. The data are preprocessed into multiple top-level feature datasets. Then, feature learning is performed by using GAEs to generate new feature matrices, followed by similarity learning based on distance fusion methods. The learned similarity matrices are fed back to the GAEs to guide their feature learning process. Finally, the abovementioned steps are iterated continuously to integrate the final consistent similarity matrix and perform other related downstream analyses. The scGAC model can accurately identify critical features and effectively preserve the internal structure of the data. This can further improve the accuracy of cell type identification.
科研通智能强力驱动
Strongly Powered by AbleSci AI