组学
数据集成
计算机科学
表观遗传学
代谢组学
蛋白质组学
基因组学
数据科学
计算生物学
机器学习
生物信息学
数据挖掘
生物
基因组
基因
基因表达
DNA甲基化
生物化学
作者
Milan Picard,Marie‐Pier Scott‐Boyer,Antoine Bodein,Olivier Périn,Arnaud Droit
标识
DOI:10.1016/j.csbj.2021.06.030
摘要
Increased availability of high-throughput technologies has generated an ever-growing number of omics data that seek to portray many different but complementary biological layers including genomics, epigenomics, transcriptomics, proteomics, and metabolomics. New insight from these data have been obtained by machine learning algorithms that have produced diagnostic and classification biomarkers. Most biomarkers obtained to date however only include one omic measurement at a time and thus do not take full advantage of recent multi-omics experiments that now capture the entire complexity of biological systems. Multi-omics data integration strategies are needed to combine the complementary knowledge brought by each omics layer. We have summarized the most recent data integration methods/ frameworks into five different integration strategies: early, mixed, intermediate, late and hierarchical. In this mini-review, we focus on challenges and existing multi-omics integration strategies by paying special attention to machine learning applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI