计算机科学
医疗保健
数据科学
领域(数学分析)
优势和劣势
数据共享
合成数据
公制(单位)
数据质量
人工智能
医学
数学分析
哲学
病理
认识论
经济
经济增长
替代医学
数学
运营管理
作者
Hajra Murtaza,Musharif Ahmed,Naurin Farooq Khan,Ghulam Murtaza,Saad Zafar,Ambreen Bano
标识
DOI:10.1016/j.cosrev.2023.100546
摘要
Recent progress in artificial intelligence and machine learning has led to the growth of research in every aspect of life including the health care domain. However, privacy risks and legislations hinder the availability of patient data to researchers. Synthetic data (SD) has been regarded as a privacy-safe alternative to real data and has lately been employed in many research and academic endeavors. This growing body of research needs to be consolidated for the researchers and practitioners to gain a quick and fruitful comprehension of the state of the art in synthetic data generation in health care. The purpose of this study is to collate and synthesize the current state of synthetic data generation following a narrative review of 70 peer-reviewed studies discussing privacy-preserving synthetic medical data generation techniques. The literature shows the effectiveness of synthetic datasets for different applications in research, academics, and testing according to existing statistical and task-based utility metrics. However, the focus on longitudinal synthetic data seems deficient. Moreover, a unified metric for generic quality assessment of synthetic data is lacking. The results of this review will serve as a quick reference guide for the researchers and practitioners in the healthcare domain to select a suitable synthetic data strategy for their application based on its strengths and weaknesses and pave the path for further research and development in healthcare.
科研通智能强力驱动
Strongly Powered by AbleSci AI