化学
标杆管理
数据集成
水准点(测量)
数据挖掘
样品(材料)
实验数据
统计
计算机科学
大地测量学
数学
色谱法
业务
营销
地理
作者
Yaguo Gong,Yuntao Dai,Qibiao Wu,Li Guo,Xiaojun Yao,Qingxia Yang
标识
DOI:10.1021/acs.analchem.4c04933
摘要
Single-cell proteomics (SCP) detected based on different technologies always involves batch-specific variations because of differences in sample processing and other potential biases. How to integrate SCP data effectively has become a great challenge. Integration of SCP data not only requires the conservation of true biological variances, but also realizes the removal of unwanted batch effects. In this study, benchmarking analysis of popular data integration methods was conducted to determine the most suitable method for SCP data. To comprehensively evaluate the performance of these integration methods, a novel evaluation system was proposed for integrating SCP data. This evaluation system consists of three objective measures from different perspectives: category (a), the efficacy of correcting batch effects; category (b), the power of conserving biological variances; and category (c), the ability to identify consistent markers. For this comprehensive evaluation, five benchmark data sets under different scenarios (containing substantial proteins, substantial cells, multiple batches, multiple cell types, and unbalanced data) were utilized for selecting the most suitable data integration method. As a result, three methods, ComBat, Scanorama, and Seurat version 3 CCA, were identified as the most recommended methods for integrating SCP data. Overall, this systematic evaluation might provide valuable guidance in choosing the appropriate method for data integration in the SCP.
科研通智能强力驱动
Strongly Powered by AbleSci AI