工作流程
计算机科学
文档
复制(统计)
标准化
缺少数据
软件
蛋白质组学
数据科学
领域(数学)
定量蛋白质组学
数据挖掘
数据库
生物
程序设计语言
机器学习
基因
操作系统
病毒学
纯数学
生物化学
数学
作者
Christophe Vanderaa,Laurent Gatto
标识
DOI:10.1080/14789450.2021.1988571
摘要
Mass spectrometry-based proteomics is actively embracing quantitative, single-cell level analyses. Indeed, recent advances in sample preparation and mass spectrometry (MS) have enabled the emergence of quantitative MS-based single-cell proteomics (SCP). While exciting and promising, SCP still has many rough edges. The current analysis workflows are custom and built from scratch. The field is therefore craving for standardized software that promotes principled and reproducible SCP data analyses.This special report is the first step toward the formalization and standardization of SCP data analysis. scp, the software that accompanies this work, successfully replicates one of the landmark SCP studies and is applicable to other experiments and designs. We created a repository containing the replicated workflow with comprehensive documentation in order to favor further dissemination and improvements of SCP data analyses.Replicating SCP data analyses uncovers important challenges in SCP data analysis. We describe two such challenges in detail: batch correction and data missingness. We provide the current state-of-the-art and illustrate the associated limitations. We also highlight the intimate dependence that exists between batch effects and data missingness and offer avenues for dealing with these exciting challenges.
科研通智能强力驱动
Strongly Powered by AbleSci AI