药物发现
生物标志物发现
知识抽取
管道(软件)
数据科学
背景(考古学)
生物医学文本挖掘
生物学数据
计算生物学
数据集成
计算机科学
K-最优模式发现
药学
数据挖掘
翻译生物信息学
生物信息学
文本挖掘
基因组学
蛋白质组学
生物
基因
基因组
古生物学
生物化学
程序设计语言
作者
Yongliang Yang,S. James Adelstein,Amin I. Kassis
标识
DOI:10.1016/j.drudis.2011.12.006
摘要
Data mining of available biomedical data and information has greatly boosted target discovery in the 'omics' era. Target discovery is the key step in the biomarker and drug discovery pipeline to diagnose and fight human diseases. In biomedical science, the 'target' is a broad concept ranging from molecular entities (such as genes, proteins and miRNAs) to biological phenomena (such as molecular functions, pathways and phenotypes). Within the context of biomedical science, data mining refers to a bioinformatics approach that combines biological concepts with computer tools or statistical methods that are mainly used to discover, select and prioritize targets. In response to the huge demand of data mining for target discovery in the 'omics' era, this review explicates various data mining approaches and their applications to target discovery with emphasis on text and microarray data analysis. Two emerging data mining approaches, chemogenomic data mining and proteomic data mining, are briefly introduced. Also discussed are the limitations of various data mining approaches found in the level of database integration, the quality of data annotation, sample heterogeneity and the performance of analytical and mining tools. Tentative strategies of integrating different data sources for target discovery, such as integrated text mining with high-throughput data analysis and integrated mining with pathway databases, are introduced.
科研通智能强力驱动
Strongly Powered by AbleSci AI