Ensembl公司
生物导体
计算机科学
标识符
计算生物学
R包
基因组学
数据挖掘
数据集成
基因组
数据科学
生物
基因
遗传学
程序设计语言
作者
Steffen Durinck,Paul T. Spellman,Ewan Birney,Wolfgang Huber
出处
期刊:Nature Protocols
[Springer Nature]
日期:2009-07-23
卷期号:4 (8): 1184-1191
被引量:2871
标识
DOI:10.1038/nprot.2009.97
摘要
Genomic experiments produce multiple views of biological systems, among them are DNA sequence and copy number variation, and mRNA and protein abundance. Understanding these systems needs integrated bioinformatic analysis. Public databases such as Ensembl provide relationships and mappings between the relevant sets of probe and target molecules. However, the relationships can be biologically complex and the content of the databases is dynamic. We demonstrate how to use the computational environment R to integrate and jointly analyze experimental datasets, employing BioMart web services to provide the molecule mappings. We also discuss typical problems that are encountered in making gene-to-transcript-to-protein mappings. The approach provides a flexible, programmable and reproducible basis for state-of-the-art bioinformatic data integration.
科研通智能强力驱动
Strongly Powered by AbleSci AI