表观基因组
DNA甲基化
生物
生物标志物
疾病
计算生物学
生物信息学
甲基化
计算机科学
遗传学
DNA
医学
基因
内科学
基因表达
作者
Paul Yousefi,Matthew Suderman,Ryan Langdon,Oliver Whitehurst,George Davey Smith,Caroline Relton
标识
DOI:10.1038/s41576-022-00465-w
摘要
DNA methylation data have become a valuable source of information for biomarker development, because, unlike static genetic risk estimates, DNA methylation varies dynamically in relation to diverse exogenous and endogenous factors, including environmental risk factors and complex disease pathology. Reliable methods for genome-wide measurement at scale have led to the proliferation of epigenome-wide association studies and subsequently to the development of DNA methylation-based predictors across a wide range of health-related applications, from the identification of risk factors or exposures, such as age and smoking, to early detection of disease or progression in cancer, cardiovascular and neurological disease. This Review evaluates the progress of existing DNA methylation-based predictors, including the contribution of machine learning techniques, and assesses the uptake of key statistical best practices needed to ensure their reliable performance, such as data-driven feature selection, elimination of data leakage in performance estimates and use of generalizable, adequately powered training samples.
科研通智能强力驱动
Strongly Powered by AbleSci AI