已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

Machine learning and statistical methods for clustering single-cell RNA-sequencing data

聚类分析 计算机科学 计算生物学 数据挖掘 核糖核酸 人工智能 RNA序列 生物 遗传学 转录组 基因 基因表达
作者
Raphael Petegrosso,Zhuliu Li,Rui Kuang
出处
期刊:Briefings in Bioinformatics [Oxford University Press]
卷期号:21 (4): 1209-1223 被引量:187
标识
DOI:10.1093/bib/bbz063
摘要

Single-cell RNAsequencing (scRNA-seq) technologies have enabled the large-scale whole-transcriptome profiling of each individual single cell in a cell population. A core analysis of the scRNA-seq transcriptome profiles is to cluster the single cells to reveal cell subtypes and infer cell lineages based on the relations among the cells. This article reviews the machine learning and statistical methods for clustering scRNA-seq transcriptomes developed in the past few years. The review focuses on how conventional clustering techniques such as hierarchical clustering, graph-based clustering, mixture models, $k$-means, ensemble learning, neural networks and density-based clustering are modified or customized to tackle the unique challenges in scRNA-seq data analysis, such as the dropout of low-expression genes, low and uneven read coverage of transcripts, highly variable total mRNAs from single cells and ambiguous cell markers in the presence of technical biases and irrelevant confounding biological variations. We review how cell-specific normalization, the imputation of dropouts and dimension reduction methods can be applied with new statistical or optimization strategies to improve the clustering of single cells. We will also introduce those more advanced approaches to cluster scRNA-seq transcriptomes in time series data and multiple cell populations and to detect rare cell types. Several software packages developed to support the cluster analysis of scRNA-seq data are also reviewed and experimentally compared to evaluate their performance and efficiency. Finally, we conclude with useful observations and possible future directions in scRNA-seq data analytics.All the source code and data are available at https://github.com/kuanglab/single-cell-review.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
niaaaa关注了科研通微信公众号
2秒前
Joyful发布了新的文献求助10
4秒前
4秒前
zhang完成签到 ,获得积分10
5秒前
5秒前
5秒前
FashionBoy应助Amanda采纳,获得10
6秒前
科研通AI6.3应助wang采纳,获得30
7秒前
小二郎应助危机的归尘采纳,获得10
7秒前
无情的远山完成签到,获得积分10
8秒前
think1805完成签到,获得积分10
8秒前
优秀剑愁发布了新的文献求助10
10秒前
10秒前
11秒前
FF发布了新的文献求助10
11秒前
啊哈哈哈发布了新的文献求助10
11秒前
wanci应助小明明采纳,获得10
13秒前
14秒前
15秒前
星辰大海应助QUPY采纳,获得10
15秒前
乐乐应助就这样堕落采纳,获得10
16秒前
16秒前
18秒前
666发布了新的文献求助10
18秒前
情怀应助dart1023采纳,获得10
18秒前
18秒前
悲伤猫猫头完成签到,获得积分10
19秒前
王夕月完成签到,获得积分10
19秒前
Ava应助Lee采纳,获得10
20秒前
西门戆戆发布了新的文献求助30
20秒前
NexusExplorer应助黑色风衣采纳,获得80
20秒前
Xieyusen完成签到,获得积分10
21秒前
21秒前
希望天下0贩的0应助ZB采纳,获得10
21秒前
小木屋完成签到,获得积分10
21秒前
21秒前
22秒前
23秒前
大气靳发布了新的文献求助10
23秒前
23秒前
高分求助中
Standards for Molecular Testing for Red Cell, Platelet, and Neutrophil Antigens, 7th edition 1000
HANDBOOK OF CHEMISTRY AND PHYSICS 106th edition 1000
ASPEN Adult Nutrition Support Core Curriculum, Fourth Edition 1000
Signals, Systems, and Signal Processing 610
脑电大模型与情感脑机接口研究--郑伟龙 500
GMP in Practice: Regulatory Expectations for the Pharmaceutical Industry 500
简明药物化学习题答案 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6298697
求助须知:如何正确求助?哪些是违规求助? 8115649
关于积分的说明 16990253
捐赠科研通 5360045
什么是DOI,文献DOI怎么找? 2847555
邀请新用户注册赠送积分活动 1824997
关于科研通互助平台的介绍 1679320