插补(统计学)
缺少数据
计算机科学
人工智能
降噪
数据挖掘
模式识别(心理学)
机器学习
作者
Lovedeep Gondara,Ke Wang
标识
DOI:10.1007/978-3-319-93040-4_21
摘要
Missing data is a significant problem impacting all domains. State-of-the-art framework for minimizing missing data bias is multiple imputation, for which the choice of an imputation model remains nontrivial. We propose a multiple imputation model based on overcomplete deep denoising autoencoders. Our proposed model is capable of handling different data types, missingness patterns, missingness proportions and distributions. Evaluation on several real life datasets show our proposed model significantly outperforms current state-of-the-art methods under varying conditions while simultaneously improving end of the line analytics.
科研通智能强力驱动
Strongly Powered by AbleSci AI