计算机科学
生物导体
可扩展性
可视化
计算生物学
协议(科学)
概化理论
数据挖掘
网格单元
数据可视化
生物
网格
基因
数据库
遗传学
统计
医学
病理
替代医学
数学
几何学
作者
Stephan Fischer,Megan Crow,Benjamin D. Harris,Jesse Gillis
出处
期刊:Nature Protocols
[Springer Nature]
日期:2021-07-07
卷期号:16 (8): 4031-4067
被引量:28
标识
DOI:10.1038/s41596-021-00575-5
摘要
Single-cell RNA-sequencing data have significantly advanced the characterization of cell-type diversity and composition. However, cell-type definitions vary across data and analysis pipelines, raising concerns about cell-type validity and generalizability. With MetaNeighbor, we proposed an efficient and robust quantification of cell-type replicability that preserves dataset independence and is highly scalable compared to dataset integration. In this protocol, we show how MetaNeighbor can be used to characterize cell-type replicability by following a simple three-step procedure: gene filtering, neighbor voting and visualization. We show how these steps can be tailored to quantify cell-type replicability, determine gene sets that contribute to cell-type identity and pretrain a model on a reference taxonomy to rapidly assess newly generated data. The protocol is based on an open-source R package available from Bioconductor and GitHub, requires basic familiarity with Rstudio or the R command line and can typically be run in <5 min for millions of cells.
科研通智能强力驱动
Strongly Powered by AbleSci AI