缺少数据
支持向量机
计算机科学
数据挖掘
回归
数据集
集合(抽象数据类型)
价值(数学)
回归分析
人工智能
模式识别(心理学)
统计
机器学习
数学
程序设计语言
作者
Feng Honghai,Chen Guoshun,Yin Cheng,Yang Bing-ru,Chen Yumei
摘要
In KDD procedure, to fill in missing data typically requires a very large investment of time and energy – often 80% to 90% of a data analysis project is spent in making the data reliable enough so that the results can be trustful. In this paper, we propose a SVM regression based algorithm for filling in missing data, i.e. set the decision attribute (output attribute) as the condition attribute (input attribute) and the condition attribute as the decision attribute, then use SVM regression to predict the condition attribute values. SARS data set experimental results show that SVM regression method has the highest precision. The method with which the value of the example that has the minimum distance to the example with missing value will be taken to fill in the missing values takes the second place, and the mean and median methods have lower precision.
科研通智能强力驱动
Strongly Powered by AbleSci AI