组学
计算机科学
嵌入
推论
可扩展性
数据集成
标杆管理
图形
计算生物学
系统生物学
模块化设计
数据挖掘
生物信息学
人工智能
生物
理论计算机科学
数据库
操作系统
业务
营销
标识
DOI:10.1038/s41587-022-01284-4
摘要
Despite the emergence of experimental methods for simultaneous measurement of multiple omics modalities in single cells, most single-cell datasets include only one modality. A major obstacle in integrating omics data from multiple modalities is that different omics layers typically have distinct feature spaces. Here, we propose a computational framework called GLUE (graph-linked unified embedding), which bridges the gap by modeling regulatory interactions across omics layers explicitly. Systematic benchmarking demonstrated that GLUE is more accurate, robust and scalable than state-of-the-art tools for heterogeneous single-cell multi-omics data. We applied GLUE to various challenging tasks, including triple-omics integration, integrative regulatory inference and multi-omics human cell atlas construction over millions of cells, where GLUE was able to correct previous annotations. GLUE features a modular design that can be flexibly extended and enhanced for new analysis tasks. The full package is available online at https://github.com/gao-lab/GLUE .
科研通智能强力驱动
Strongly Powered by AbleSci AI