细胞
单细胞分析
电池类型
诱导多能干细胞
可扩展性
计算生物学
计算机科学
核糖核酸
细胞培养
管道(软件)
生物
人工智能
遗传学
基因
胚胎干细胞
数据库
程序设计语言
作者
Jesus Gonzalez-Ferrer,Julian Lehrer,Ash O’Farrell,Benedict Paten,Mircea Teodorescu,David Haussler,Vanessa D. Jönsson,Mohammed A. Mostajo-Radji
标识
DOI:10.1016/j.xgen.2024.100581
摘要
Cell atlases serve as vital references for automating cell labeling in new samples, yet existing classification algorithms struggle with accuracy. Here we introduce SIMS (scalable, interpretable machine learning for single cell), a low-code data-efficient pipeline for single-cell RNA classification. We benchmark SIMS against datasets from different tissues and species. We demonstrate SIMS's efficacy in classifying cells in the brain, achieving high accuracy even with small training sets (<3,500 cells) and across different samples. SIMS accurately predicts neuronal subtypes in the developing brain, shedding light on genetic changes during neuronal differentiation and postmitotic fate refinement. Finally, we apply SIMS to single-cell RNA datasets of cortical organoids to predict cell identities and uncover genetic variations between cell lines. SIMS identifies cell-line differences and misannotated cell lineages in human cortical organoids derived from different pluripotent stem cell lines. Altogether, we show that SIMS is a versatile and robust tool for cell-type classification from single-cell datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI