计算机科学
数据科学
计算生物学
高维
数据挖掘
生物信息学
人工智能
生物
作者
Robert Clarke,Habtom W. Ressom,Antai Wang,Jianhua Xuan,Minetta C. Liu,Edmund A. Gehan,Yue Wang
出处
期刊:Nature Reviews Cancer
[Springer Nature]
日期:2007-12-21
卷期号:8 (1): 37-49
被引量:541
摘要
High-dimensional genomic and proteomic data are now commonplace in cancer research. This Review aims to help biologists understand the properties of high-dimensional data spaces and how these affect our ability to derive meaningful information from the data. High-throughput genomic and proteomic technologies are widely used in cancer research to build better predictive models of diagnosis, prognosis and therapy, to identify and characterize key signalling networks and to find new targets for drug development. These technologies present investigators with the task of extracting meaningful statistical and biological information from high-dimensional data spaces, wherein each sample is defined by hundreds or thousands of measurements, usually concurrently obtained. The properties of high dimensionality are often poorly understood or overlooked in data modelling and analysis. From the perspective of translational science, this Review discusses the properties of high-dimensional data spaces that arise in genomic and proteomic studies and the challenges they can pose for data analysis and interpretation.
科研通智能强力驱动
Strongly Powered by AbleSci AI