Benchmarking algorithms for joint integration of unpaired and paired single-cell RNA-seq and ATAC-seq data

模态(人机交互) 染色质 计算机科学 注释 数据类型 计算生物学 RNA序列 标杆管理 模式 水准点(测量) 数据挖掘 人工智能 基因 生物 基因表达 遗传学 转录组 社会科学 营销 社会学 业务 程序设计语言 大地测量学 地理
作者
Michelle Y. Y. Lee,Klaus H. Kaestner,Mingyao Li
标识
DOI:10.1101/2023.02.01.526609
摘要

Abstract Single-cell RNA-sequencing (scRNA-seq) measures gene expression in single cells, while single-nucleus ATAC-sequencing (snATAC-seq) enables the quantification of chromatin accessibility in single nuclei. These two data types provide complementary information for deciphering cell types/states. However, when analyzed individually, scRNA-seq and snATAC-seq data often produce conflicting results regarding cell type/state assignment. In addition, there is a loss of power as the two modalities reflect the same underlying cell types/states. Recently, it has become possible to measure both gene expression and chromatin accessibility from the same nucleus. Such paired data make it possible to directly model the relationships between the two modalities. However, given the availability of the vast amount of single-modality data, it is desirable to integrate the paired and unpaired single-modality data to gain a comprehensive view of the cellular complexity. Here, we benchmarked the performance of seven existing single-cell multi-omic data integration methods. Specifically, we evaluated whether these methods are able to uncover peak-gene associations from single-modality data, and to what extent the multiome data can provide additional guidance for the analysis of the existing single-modality data. Our results indicate that multiome data are helpful for annotating single-modality data, but the number of cells in the multiome data is critical to ensure a good cell type annotation. Additionally, when generating a multiome dataset, the number of cells is more important than sequencing depth for cell type annotation. Lastly, Seurat v4 is the best at integrating scRNA-seq, snATAC-seq, and multiome data even in the presence of complex batch effects.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
愉快的寻雪完成签到,获得积分10
刚刚
刚刚
池子恒完成签到,获得积分10
2秒前
山月发布了新的文献求助10
3秒前
3秒前
4秒前
上官若男应助wwwcf采纳,获得10
5秒前
5秒前
6秒前
深居简出发布了新的文献求助10
6秒前
流萤发布了新的文献求助10
6秒前
Hello应助调皮帆布鞋采纳,获得10
7秒前
赘婿应助兰彻采纳,获得10
8秒前
Whenhow发布了新的文献求助10
8秒前
阳光的语芙完成签到,获得积分10
10秒前
10秒前
times发布了新的文献求助10
10秒前
xiaoqiu完成签到,获得积分10
11秒前
晨曦完成签到,获得积分10
11秒前
molihuakai应助深居简出采纳,获得10
11秒前
啦啦啦发布了新的文献求助10
11秒前
蓝天发布了新的文献求助50
12秒前
狂野灵波发布了新的文献求助10
13秒前
lnz完成签到 ,获得积分10
13秒前
Akim应助777采纳,获得10
15秒前
15秒前
WYDNBDX2013发布了新的文献求助10
16秒前
16秒前
大模型应助all4sci采纳,获得10
17秒前
丘比特应助对映体采纳,获得10
19秒前
高兴的海豚完成签到,获得积分10
20秒前
rzzzy完成签到,获得积分10
20秒前
深情安青应助times采纳,获得10
21秒前
天天天才发布了新的文献求助10
22秒前
wwwcf发布了新的文献求助10
22秒前
22秒前
22秒前
周伯通发布了新的文献求助10
23秒前
jing完成签到,获得积分10
23秒前
无花果应助狂野灵波采纳,获得10
25秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Introduction to Cosmetic Formulation and Technology, 2nd Edition 400
Petrology and Plate Tectonics,2025 400
Burger's Medicinal Chemistry and Drug Discovery 400
A Step-by-Step Guide to Qualitative Data Coding 2nd Edition 400
Programming for Chemical Engineers Using C, C++, and MATLAB 320
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6701555
求助须知:如何正确求助?哪些是违规求助? 8443258
关于积分的说明 18036152
捐赠科研通 5937483
什么是DOI,文献DOI怎么找? 2989141
邀请新用户注册赠送积分活动 1965023
关于科研通互助平台的介绍 1908708