发布文献求助

Comparison of Performance of Data Imputation Methods for Numeric Dataset

插补（统计学）缺少数据计算机科学数据挖掘回归贝叶斯概率均方误差线性回归统计人工智能机器学习数学

作者

Anil Jadhav,Dhanya Pramod,Krishnan Ramanathan

出处

期刊：Applied Artificial Intelligence [Taylor & Francis]
日期：2019-07-04 卷期号：33 (10): 913-933 被引量：443

链接

doaj.orgdoi.org

标识

DOI：10.1080/08839514.2019.1637138

摘要

Missing data is common problem faced by researchers and data scientists. Therefore, it is required to handle them appropriately in order to get better and accurate results of data analysis. Objective of this research paper is to provide better understanding of data missingness mechanism, data imputation methods, and to assess performance of the widely used data imputation methods for numeric dataset. It will help practitioners and data scientists to select appropriate method of data imputation for numeric dataset while performing data mining task. In this paper, we comprehensively compare seven data imputation methods namely mean imputation, median imputation, kNN imputation, predictive mean matching, Bayesian Linear Regression (norm), Linear Regression, non-Bayesian (norm.nob), and random sample. We have used five different numeric datasets obtained from UCI machine learning repository for analyzing and comparing performance of the data imputation methods. Performance of the data imputation methods is assessed using Normalized Root Mean Square Error (RMSE) method. The results of analysis show that kNN imputation method outperforms the other methods. It has also been found that performance of the data imputation method is independent of the dataset and percentage of missing values in the dataset.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

⚡ 2026年影响因子、分区 已更新！ (2026-6-17)

更新

📰 新增『新锐期刊分区』 (2026-3-24)

更新

💬 新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒 每天60秒读懂世界·精选全球要闻 (2026-1-2)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 朗朗书生完成签到，获得积分10

刚刚; wanci上传了应助文件

刚刚; 柒琦完成签到，获得积分10

刚刚; Owen上传了应助文件

刚刚; 尊敬的斑马发布了新的文献求助10

刚刚; 火星上雅寒完成签到，获得积分10

刚刚; 打打的应助被拾时采纳，获得10

刚刚; 科研通AI6.4的应助被友好奇异果采纳，获得10

1秒前; yuri完成签到，获得积分10

1秒前; 12346发布了新的文献求助10

1秒前; xixosn完成签到，获得积分10

1秒前; 抓住努力的尾巴完成签到，获得积分10

1秒前; Zert完成签到，获得积分10

2秒前; 充电宝上传了应助文件

2秒前; 小透明举报丁丁的求助涉嫌违规

3秒前; 随便吧发布了新的文献求助10

3秒前; 小白关注了科研通微信公众号

3秒前; ygsts完成签到，获得积分10

3秒前; 兴奋的听云完成签到，获得积分10

4秒前; 科研通AI6.2上传了应助文件

4秒前; 染染完成签到，获得积分10

4秒前; 共享精神的应助被怪脾气采纳，获得10

4秒前; 深情安青上传了应助文件

4秒前; 桉_发布了新的文献求助10

5秒前; 紫菜完成签到，获得积分10

5秒前; wuwuwuwu发布了新的文献求助10

5秒前; 科研通AI6.4上传了应助文件

6秒前; 蔡宇滔完成签到，获得积分10

6秒前; xixi发布了新的文献求助10

6秒前; hexy629完成签到，获得积分10

7秒前; 儒雅谷芹完成签到，获得积分10

7秒前; 丘比特上传了应助文件

7秒前; 迅速手套完成签到，获得积分10

7秒前; 今后的应助被追寻又柔采纳，获得10

7秒前; xian发布了新的文献求助30

8秒前; 科研通AI6.4上传了应助文件

8秒前; 顾矜的应助被lcy666llll采纳，获得10

9秒前; 科研通AI6.4上传了应助文件

9秒前; 林狗发布了新的文献求助10

9秒前; 852的应助被111采纳，获得10

9秒前

高分求助中: Cronologia da história de Macau 5000; Merrill's Atlas of Radiographic Positioning and Procedures - 3-Volume Set, 16th Edition 2000; Interactions of Vowel Quality and Prosody in East Slavic 500; Vander's Renal Physiology第10版 500; CLSI M27M44S Performance Standards for Antifungal Susceptibility Testing of Yeasts Fourth Edition 400; Python for Chemists 400; Analytical Separation Science 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 7112645; 求助须知：如何正确求助？哪些是违规求助？ 8765979; 关于积分的说明 18537552; 捐赠科研通 6681520; 什么是DOI，文献DOI怎么找？ 3144720; 关于科研通互助平台的介绍 2260482; 邀请新用户注册赠送积分活动 2119306

今日热心研友

学术文献互助

热心市民小红花

自觉的万言

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通