发布文献求助

Automatic Noise Generation and Reduction for Text Classification

众包噪音（视频）计算机科学降噪人工智能机器学习还原（数学）噪声测量度量（数据仓库）模式识别（心理学）自然语言处理数据挖掘数学几何学图像（数学）万维网

作者

Huiyao Chen,Yueheng Sun,Meishan Zhang,Min Zhang

出处

期刊：IEEE/ACM transactions on audio, speech, and language processing [Institute of Electrical and Electronics Engineers]
日期：2023-10-16 卷期号：32: 139-150 被引量：1

标识

DOI：10.1109/taslp.2023.3325135

摘要

Label noise is an important issue in machine learning, which might lead to negative influences on various tasks. Given that real benchmarks for evaluation of noise reduction methods are limited, plenty of studies construct pseudo noisy data to verify their proposed methods. However, very few works have realized the rationality of the noise generation strategies. If the generated pseudo datasets are biased, their final conclusions might also be problematic. In this work, we focus on text classification of natural language processing (NLP) to investigate various pseudo noise generation methods, which is the first work of this line for NLP. In particular, we compare the noise generated with crowdsourcing noise, a kind of real noise as gold-standard, to evaluate these noise generation methods. After then, we measure and compare the performance of representative noise reduction methods respectively based on the data of crowdsourcing and our top-ranked pseudo noisy generation strategies. We conduct experiments on five text classification datasets, offering detailed comparison results as well as discussions.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: CodeCraft的应助被甜蜜屁池采纳，获得10

刚刚; cgl155410完成签到，获得积分10

1秒前; 宇文宛菡完成签到，获得积分10

1秒前; xzyin上传了应助文件

1秒前; 李爱国的应助被康康采纳，获得10

1秒前; 李健的小迷弟上传了应助文件

1秒前; 大个的应助被想好好搞事业采纳，获得10

5秒前; Echo完成签到，获得积分10

5秒前; 思源上传了应助文件

6秒前; zyj发布了新的文献求助10

6秒前; 超级的一斩完成签到，获得积分10

7秒前; lin发布了新的文献求助10

7秒前; 还单身的香之完成签到，获得积分10

7秒前; 852的应助被冰阔落采纳，获得10

8秒前; zink完成签到，获得积分10

8秒前; 852上传了应助文件

9秒前; mol完成签到，获得积分10

9秒前; 大卫戴完成签到，获得积分10

9秒前; 量子星尘发布了新的文献求助10

10秒前; 学术废物发布了新的文献求助10

10秒前; 科研通AI2S上传了应助文件

11秒前; 高兴的灰狼完成签到，获得积分10

11秒前; Elon完成签到，获得积分10

15秒前; fcyyc完成签到，获得积分20

15秒前; 还单身的香之发布了新的文献求助10

16秒前; 大模型的应助被zhuhaot采纳，获得50

16秒前; 研友_LMo56Z关闭了研友_LMo56Z的文献求助

17秒前; wu8577的应助被Cakeat采纳，获得10

17秒前; 小二郎的应助被ccm采纳，获得20

19秒前; 今后的应助被洺全采纳，获得10

19秒前; 拼搏诗翠完成签到，获得积分10

20秒前; Jasper的应助被实验顺顺利利采纳，获得10

21秒前; IyGnauH发布了新的文献求助10

21秒前; 昏睡的蟠桃的应助被emmm采纳，获得100

23秒前; 酷波er上传了应助文件

23秒前; 小马甲上传了应助文件

23秒前; SYLH的应助被PROTAC采纳，获得10

26秒前; 麦穗完成签到，获得积分10

26秒前; 领导范儿的应助被flasher22采纳，获得10

27秒前; 眼睛大雨筠上传了应助文件

27秒前

高分求助中: The Mother of All Tableaux Order, Equivalence, and Geometry in the Large-scale Structure of Optimality Theory 2400; Ophthalmic Equipment Market by Devices(surgical: vitreorentinal,IOLs,OVDs,contact lens,RGP lens,backflush,diagnostic&monitoring:OCT,actorefractor,keratometer,tonometer,ophthalmoscpe,OVD), End User,Buying Criteria-Global Forecast to2029 2000; Cognitive Neuroscience: The Biology of the Mind (Sixth Edition) 1000; Optimal Transport: A Comprehensive Introduction to Modeling, Analysis, Simulation, Applications 800; Official Methods of Analysis of AOAC INTERNATIONAL 600; ACSM’s Guidelines for Exercise Testing and Prescription, 12th edition 588; A Preliminary Study on Correlation Between Independent Components of Facial Thermal Images and Subjective Assessment of Chronic Stress 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3958130; 求助须知：如何正确求助？哪些是违规求助？ 3504312; 关于积分的说明 11117892; 捐赠科研通 3235623; 什么是DOI，文献DOI怎么找？ 1788403; 邀请新用户注册赠送积分活动 871211; 科研通“疑难数据库（出版商）”最低求助积分说明 802547

今日热心研友

热心市民小红花

昏睡的蟠桃

眼睛大雨筠

眯眯眼的衬衫

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通