计算机科学
生命银行
数据科学
精密医学
领域(数学)
多样性(控制论)
生成语法
机器学习
生成模型
代表(政治)
合成数据
人工智能
数据挖掘
医学
生物信息学
数学
病理
政治
政治学
纯数学
法学
生物
作者
Kara Liu,Russ B. Altman
出处
期刊:Annual review of biomedical data science
[Annual Reviews]
日期:2025-01-14
标识
DOI:10.1146/annurev-biodatasci-103123-094844
摘要
Tabular medical datasets, like electronic health records (EHRs), biobanks, and structured clinical trial data, are rich sources of information with the potential to advance precision medicine and optimize patient care. However, real-world medical datasets have limited patient diversity and cannot simulate hypothetical outcomes, both of which are necessary for equitable and effective medical research. Fueled by recent advancements in machine learning, generative models offer a promising solution to these data limitations by generating enhanced synthetic data. This review highlights the potential of conditional generative models (CGMs) to create patient-specific synthetic data for a variety of precision medicine applications. We survey CGM approaches that tackle two medical applications: correcting for data representation biases and simulating digital health twins. We additionally explore how the surveyed methods handle modeling tabular medical data and briefly discuss evaluation criteria. Finally, we summarize the technical, medical, and ethical challenges that must be addressed before CGMs can be effectively and safely deployed in the medical field.
科研通智能强力驱动
Strongly Powered by AbleSci AI