生物
计算生物学
胚胎
单元格排序
核糖核酸
分类
基因
熵(时间箭头)
基因表达
细胞
遗传学
模式识别(心理学)
人工智能
计算机科学
算法
物理
量子力学
作者
Arthur Radley,Elena Corujo-Simon,Jennifer Nichols,Austin Smith,Sara-Jane Dunn
标识
DOI:10.1016/j.stemcr.2022.09.007
摘要
Summary
A major challenge in single-cell gene expression analysis is to discern meaningful cellular heterogeneity from technical or biological noise. To address this challenge, we present entropy sorting (ES), a mathematical framework that distinguishes genes indicative of cell identity. ES achieves this in an unsupervised manner by quantifying if observed correlations between features are more likely to have occurred due to random chance versus a dependent relationship, without the need for any user-defined significance threshold. On synthetic data, we demonstrate the removal of noisy signals to reveal a higher resolution of gene expression patterns than commonly used feature selection methods. We then apply ES to human pre-implantation embryo single-cell RNA sequencing (scRNA-seq) data. Previous studies failed to unambiguously identify early inner cell mass (ICM), suggesting that the human embryo may diverge from the mouse paradigm. In contrast, ES resolves the ICM and reveals sequential lineage bifurcations as in the classical model. ES thus provides a powerful approach for maximizing information extraction from high-dimensional datasets such as scRNA-seq data.
科研通智能强力驱动
Strongly Powered by AbleSci AI