计算机科学
缺少数据
插补(统计学)
一致性(知识库)
机器学习
数据挖掘
生成语法
人工智能
生成对抗网络
健康档案
数据一致性
深度学习
医疗保健
经济
经济增长
操作系统
作者
Xutao Weng,Hong Song,Yucong Lin,You Wu,Xi Zhang,Bowen Liu,Jian Yang
标识
DOI:10.1016/j.compbiomed.2023.107687
摘要
Electronic health records (EHR), present challenges of incomplete and imbalanced data in clinical predictions. Previous studies addressed these two issues with two-step separately, which caused the decrease in the performance of prediction tasks. In this paper, we propose a unified framework to simultaneously addresses the challenges of incomplete and imbalanced data in EHR. Based on the framework, we develop a model called Missing Value Imputation and Imbalanced Learning Generative Adversarial Network (MVIIL-GAN). We use MVIIL-GAN to perform joint learning on the imputation process of high missing rate data and the conditional generation process of EHR data. The joint learning is achieved by introducing two discriminators to distinguish the fake data from the generated data at sample-level and variable-level. MVIIL-GAN integrate the missing values imputation and data generation in one step, improving the consistency of parameter optimization and the performance of prediction tasks. We evaluate our framework using the public dataset MIMIC-IV with high missing rates data and imbalanced data. Experimental results show that MVIIL-GAN outperforms existing methods in prediction performance. The implementation of MVIIL-GAN can be found at https://github.com/Peroxidess/MVIIL-GAN.
科研通智能强力驱动
Strongly Powered by AbleSci AI