已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

A label noise filtering method for regression based on adaptive threshold and noise score

噪音(视频) 计算机科学 人工智能 滤波器(信号处理) 噪声测量 机器学习 超参数 模式识别(心理学) 集合(抽象数据类型) 数据挖掘 降噪 计算机视觉 图像(数学) 程序设计语言
作者
Chuang Li,Zhizhong Mao
出处
期刊:Expert Systems With Applications [Elsevier BV]
卷期号:228: 120422-120422 被引量:9
标识
DOI:10.1016/j.eswa.2023.120422
摘要

The quality of training data plays a decisive role in the establishment of intelligent models. Since raw data obtained from the real world are usually entwined with noise due to variety of causes, noise filtering has become an important aspect of machine learning techniques. In contrast with the extensive research conducted on noise elimination for classification purposes, papers addressing this problem for regression tasks are rather scarce. In this paper, we propose a novel noise filter to clean noisy instances with real-valued label noise. Aiming at the deficiency of the existing noise determination criterion, a new adaptive threshold-based method is first proposed. It allows a noisy instance to be adaptively defined according to the fitting difficulty levels of different datasets, and areas with different densities. Embedded with this criterion, an effective noise filtering procedure is also designed. An ensemble filtering scheme and an iterative filtering process are combined to detect as many potential noisy samples as possible from the original training set. According to the acquire noise detection information, a noise score for evaluating the noise level is specifically developed. The potential noisy samples whose scores exceed a reasonable threshold are further filtered, which can compensate for the possible errors incurred during the previous procedure, and contribute to more reliable filtering results. The validity of the proposed method is studied in exhaustive experiments. We discuss reasonable hyperparameters, and compare the developed method with several state-of-the-art noise filters. The outcomes show that the prediction accuracy of the utilized regressor can greatly benefit from preprocessing the given raw dataset by using our method. Simultaneously, the method is able to acquire a good balance between the elimination of noisy samples and the retention of clean samples, and consistently achieves a better noise filtering performance.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
824发布了新的文献求助10
刚刚
天天快乐应助tangz采纳,获得10
1秒前
11223344发布了新的文献求助10
2秒前
hzh发布了新的文献求助10
3秒前
科研通AI5应助文艺白筠采纳,获得10
5秒前
lorenz完成签到,获得积分10
6秒前
沸羊羊完成签到,获得积分10
12秒前
18秒前
18秒前
JamesPei应助老中医采纳,获得30
20秒前
ming发布了新的文献求助10
25秒前
Li完成签到 ,获得积分10
29秒前
ikea1984发布了新的文献求助30
29秒前
秋霜完成签到 ,获得积分10
33秒前
仙女爷爷完成签到,获得积分10
37秒前
科研通AI2S应助清秀元芹采纳,获得10
39秒前
39秒前
丁元英完成签到,获得积分10
40秒前
kokoko完成签到,获得积分10
42秒前
43秒前
43秒前
47秒前
背后时光发布了新的文献求助10
47秒前
50秒前
52秒前
HHR33应助Brightan采纳,获得10
55秒前
深情安青应助背后时光采纳,获得10
56秒前
Odingers发布了新的文献求助10
56秒前
yoyo完成签到,获得积分10
1分钟前
Grayball应助科研通管家采纳,获得10
1分钟前
huiya应助科研通管家采纳,获得10
1分钟前
Grayball应助科研通管家采纳,获得10
1分钟前
Grayball应助科研通管家采纳,获得10
1分钟前
Grayball应助科研通管家采纳,获得10
1分钟前
1分钟前
Grayball应助科研通管家采纳,获得10
1分钟前
开心岩应助科研通管家采纳,获得10
1分钟前
星辰大海应助科研通管家采纳,获得10
1分钟前
Grayball应助科研通管家采纳,获得10
1分钟前
824完成签到,获得积分10
1分钟前
高分求助中
Production Logging: Theoretical and Interpretive Elements 2700
Neuromuscular and Electrodiagnostic Medicine Board Review 1000
こんなに痛いのにどうして「なんでもない」と医者にいわれてしまうのでしょうか 510
The First Nuclear Era: The Life and Times of a Technological Fixer 500
岡本唐貴自伝的回想画集 500
Distinct Aggregation Behaviors and Rheological Responses of Two Terminally Functionalized Polyisoprenes with Different Quadruple Hydrogen Bonding Motifs 450
Ciprofol versus propofol for adult sedation in gastrointestinal endoscopic procedures: a systematic review and meta-analysis 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3671101
求助须知:如何正确求助?哪些是违规求助? 3228010
关于积分的说明 9777928
捐赠科研通 2938234
什么是DOI,文献DOI怎么找? 1609784
邀请新用户注册赠送积分活动 760457
科研通“疑难数据库(出版商)”最低求助积分说明 735962