缺少数据
插补(统计学)
计算机科学
加权
集成学习
可学性
数据挖掘
可用性
人工智能
集合预报
机器学习
医学
人机交互
放射科
作者
Min Wang,Binqian Li,Fan Min,Jiaxue Liu,Manlong Wang
标识
DOI:10.1109/icnsc48988.2020.9238068
摘要
Real data is often incomplete, which hinders its usability and learnability. A reasonable machine learning scenario is to obtain some values and labels at cost upon request. In this paper, we propose a new ensemble active missing imputation (EAMI) algorithm to handle the learning task. First, we design five missing imputation methods, including mean filling, cubic spline interpolation filling, sample-based collaborative filtering weighed filling, attribute-based collaborative filtering weighted filling and k-nearest neighbor (KNN) filling. Second, we propose an ensemble imputation model through the linear weighting of attribute prediction values. Third, We propose a three-way decisions model that uses the variance of the predicted values to fill in missing values by querying true label or using predicted values. We conduct experiments on University of California Irvine(UCI) datasets. The results of significance test verify the effectiveness of EAMI and its superiority over KNN missing data imputation algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI