计算生物学
仿形(计算机编程)
基因表达谱
药物发现
生物
基因表达
计算机科学
基因
数据挖掘
生物信息学
遗传学
操作系统
作者
Marzieh Haghighi,Juan C. Caicedo,Beth A. Cimini,Anne E. Carpenter,Shantanu Singh
出处
期刊:Nature Methods
[Springer Nature]
日期:2022-11-07
卷期号:19 (12): 1550-1557
被引量:69
标识
DOI:10.1038/s41592-022-01667-0
摘要
Cells can be perturbed by various chemical and genetic treatments and the impact on gene expression and morphology can be measured via transcriptomic profiling and image-based assays, respectively. The patterns observed in these high-dimensional profile data can power a dozen applications in drug discovery and basic biology research, but both types of profiles are rarely available for large-scale experiments. Here, we provide a collection of four datasets with both gene expression and morphological profile data useful for developing and testing multimodal methodologies. Roughly a thousand features are measured for each of the two data types, across more than 28,000 chemical and genetic perturbations. We define biological problems that use the shared and complementary information in these two data modalities, provide baseline analysis and evaluation metrics for multi-omic applications, and make the data resource publicly available ( https://broad.io/rosetta/ ). This Resource presents and analyzes four datasets containing both gene expression and morphological profile data for cells subjected to hundreds to thousands of chemical or genetic perturbations and highlights their complementary nature.
科研通智能强力驱动
Strongly Powered by AbleSci AI