负二项分布
主成分分析
规范化(社会学)
计算机科学
计数数据
RNA序列
计算生物学
核糖核酸
算法
数据挖掘
基因
生物
数学
人工智能
统计
转录组
遗传学
基因表达
泊松分布
社会学
人类学
作者
Davide Risso,Fanny Perraudeau,Svetlana Gribkova,Sandrine Dudoit,Jean‐Philippe Vert
标识
DOI:10.1038/s41467-017-02554-5
摘要
Abstract Single-cell RNA-sequencing (scRNA-seq) is a powerful high-throughput technique that enables researchers to measure genome-wide transcription levels at the resolution of single cells. Because of the low amount of RNA present in a single cell, some genes may fail to be detected even though they are expressed; these genes are usually referred to as dropouts. Here, we present a general and flexible zero-inflated negative binomial model (ZINB-WaVE), which leads to low-dimensional representations of the data that account for zero inflation (dropouts), over-dispersion, and the count nature of the data. We demonstrate, with simulated and real data, that the model and its associated estimation procedure are able to give a more stable and accurate low-dimensional representation of the data than principal component analysis (PCA) and zero-inflated factor analysis (ZIFA), without the need for a preliminary normalization step.
科研通智能强力驱动
Strongly Powered by AbleSci AI