亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Machine learning-based imputation soft computing approach for large missing scale and non-reference data imputation

缺少数据 插补(统计学) 计算机科学 数据挖掘 机器学习
作者
A. H. Alamoodi,B. B. Zaidan,A. A. Zaidan,O. S. Albahri,Juliana Chen,M. A. Chyad,Salem Garfan,Ahmed Marwan Aleesa
出处
期刊:Chaos Solitons & Fractals [Elsevier]
卷期号:151: 111236-111236 被引量:31
标识
DOI:10.1016/j.chaos.2021.111236
摘要

Missing data is a common problem in real-world data sets and it is amongst the most complex topics in computer science and many other research domains. The common ways to cope with missing values are either by elimination or imputation depending of the volume of the missing data and its distribution nature. It becomes imperative to come up with new imputation approaches along with efficient algorithms. Though most existing imputation methods focus on a moderate amount of missing data, imputation for high missing rates over 80% is still important but challenging. Even with the existence of some works in addressing high missing volume issue, they mostly rely on imputing reference dataset (Complete Datasets for evaluation) after they create artificial missing values and impute it to measure the accuracy of their proposed techniques. So far, the option of imputing high proportions of missing values with no reference comparison dataset (Original Dataset with highly missing values) have been often ignored or overlooked. Therefore, we propose a missing data imputation approach for high volumes of missing values with no reference comparison dataset. The approach makes use of pre-processing measures and breaking the dataset into small continuous non-missing portions then using Multi Criteria Decision-making analysis to select a portion of data which is representative of the entire broken datasets. This portion helps to create reference comparisons and expands the missing dataset through artificial missing-making procedures with different percentages and imputation using different machine learning techniques. This study conducted two experiments using BMI datasets with more than 80% of missing values, derived from the National Child Development Centre (NCDRC) at Sultan Idris Education University (UPSI), Malaysia. The results show that our approach capability in reconstructing datasets with huge missing values.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
andy发布了新的文献求助10
3秒前
hwen1998完成签到 ,获得积分10
13秒前
科研通AI2S应助科研通管家采纳,获得10
16秒前
所所应助科研通管家采纳,获得10
16秒前
18秒前
taotao发布了新的文献求助10
25秒前
韩保晨完成签到 ,获得积分10
27秒前
taotao完成签到,获得积分10
33秒前
YY完成签到,获得积分10
41秒前
Dejanice完成签到,获得积分10
57秒前
端庄的猕猴桃完成签到 ,获得积分10
58秒前
自由的梦露完成签到 ,获得积分10
1分钟前
1分钟前
1分钟前
水若冰寒完成签到,获得积分10
1分钟前
Swiftie完成签到 ,获得积分10
1分钟前
梦_筱彩完成签到 ,获得积分10
1分钟前
好想吃大餐完成签到,获得积分10
1分钟前
ty完成签到,获得积分10
1分钟前
2分钟前
深情安青应助ty采纳,获得10
2分钟前
qqqqgc发布了新的文献求助10
2分钟前
彭于晏应助林攸之采纳,获得10
2分钟前
年轻的凤完成签到 ,获得积分10
2分钟前
lingshan完成签到 ,获得积分10
2分钟前
2分钟前
研友_VZG7GZ应助科研通管家采纳,获得50
2分钟前
英姑应助科研通管家采纳,获得10
2分钟前
Jasper应助科研通管家采纳,获得10
2分钟前
2分钟前
orixero应助科研通管家采纳,获得10
2分钟前
科研通AI2S应助郁乾采纳,获得10
2分钟前
充电宝应助健康的修洁采纳,获得10
2分钟前
完美世界应助LYUT嘎嘎采纳,获得10
2分钟前
CodeCraft应助pluto采纳,获得10
2分钟前
无限的石头完成签到 ,获得积分10
2分钟前
fendy完成签到,获得积分10
2分钟前
受伤幻桃完成签到 ,获得积分10
2分钟前
2分钟前
pluto发布了新的文献求助10
2分钟前
高分求助中
求助这个网站里的问题集 1000
Floxuridine; Third Edition 1000
Models of Teaching(The 10th Edition,第10版!)《教学模式》(第10版!) 800
La décision juridictionnelle 800
Rechtsphilosophie und Rechtstheorie 800
Nonlocal Integral Equation Continuum Models: Nonstandard Symmetric Interaction Neighborhoods and Finite Element Discretizations 500
Academic entitlement: Adapting the equity preference questionnaire for a university setting 500
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2871913
求助须知:如何正确求助?哪些是违规求助? 2479922
关于积分的说明 6720156
捐赠科研通 2166371
什么是DOI,文献DOI怎么找? 1151059
版权声明 585660
科研通“疑难数据库(出版商)”最低求助积分说明 565044