Dual adversarial network with meta-learning for domain-generalized few-shot text classification

鉴别器 计算机科学 人工智能 对抗制 元学习(计算机科学) 机器学习 领域(数学分析) 杠杆(统计) 对偶(语法数字) 一般化 发电机(电路理论) 模式识别(心理学) 任务(项目管理) 数学 功率(物理) 艺术 电信 数学分析 管理 文学类 量子力学 经济 物理 探测器
作者
Xuyang Wang,Yajun Du,Danroujing Chen,Xianyong Li,Xiaoliang Chen,Yongquan Fan,Chunzhi Xie,Yanli Li,Jia Liu,Hui Li
出处
期刊:Applied Soft Computing [Elsevier]
卷期号:146: 110697-110697 被引量:1
标识
DOI:10.1016/j.asoc.2023.110697
摘要

Meta-learning-based methods prevail in few-shot text classification. Current methods perform meta-training and meta-testing on two parts of a dataset in the same or similar domains. This results in a significant limit in model performance when faced with data from different domains, limiting the generalization of few-shot models. To solve this problem, this study proposes a new setting, namely, domain-generalized few-shot text classification. First, meta-training is conducted on a multi-domain dataset to learn a generalizable model. Subsequently, the model is meta-tested on a target dataset. In addition, a domain-generalized model, namely, a dual adversarial network, is designed to improve the meta-learning-based methods under domain drift between different datasets and domains. Unlike previous meta-learning methods, two N-way-K-shot tasks were input from different domains for a dual adversarial network at each episode. Dual adversarial networks leverage the features from two different domains for adversarial training to improve the domain adaptability of the model. The proposed model utilizes a domain-knowledge generator during adversarial training to produce domain-specific knowledge, and a domain discriminator to recognize the domain label of the produced knowledge. Extensive experiments are conducted to verify the effectiveness of the proposed settings and model. The experimental results show that the model performance in our proposed setting is improved by an average of 3.84% compared to that in cross-domain few-shot text classification. Furthermore, the dual adversarial network significantly outperforms the five competitive baseline models, with an average improvement of 7.20%. The proposed model achieves an average performance improvement of 2.69% compared with the best baseline method.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
英俊的铭应助saxg_hu采纳,获得10
7秒前
老德关注了科研通微信公众号
12秒前
17秒前
传奇3应助saxg_hu采纳,获得10
20秒前
领导范儿应助中中采纳,获得10
21秒前
夏风下完成签到 ,获得积分10
21秒前
包谷冬完成签到 ,获得积分0
30秒前
zho应助热热带汤采纳,获得10
31秒前
31秒前
高帮白袜关注了科研通微信公众号
32秒前
李建科完成签到,获得积分10
32秒前
32秒前
33秒前
35秒前
Amir发布了新的文献求助10
37秒前
老德完成签到,获得积分10
37秒前
38秒前
坚强的广山应助陈豆豆采纳,获得20
38秒前
欢呼的鲂完成签到,获得积分10
41秒前
42秒前
科研通AI2S应助吕小菜采纳,获得10
47秒前
高帮白袜发布了新的文献求助30
48秒前
chenx完成签到 ,获得积分10
50秒前
53秒前
林生完成签到 ,获得积分10
55秒前
在水一方应助yang采纳,获得30
55秒前
秋雨发布了新的文献求助10
57秒前
jmsd完成签到 ,获得积分10
59秒前
1分钟前
1分钟前
科研能完成签到,获得积分10
1分钟前
1分钟前
saxg_hu发布了新的文献求助10
1分钟前
Mole完成签到,获得积分10
1分钟前
zho应助吕小菜采纳,获得10
1分钟前
zho应助牛人采纳,获得10
1分钟前
sswbzh应助suzy-123采纳,获得50
1分钟前
香蕉觅云应助Mole采纳,获得10
1分钟前
1分钟前
asd关闭了asd文献求助
1分钟前
高分求助中
求助这个网站里的问题集 1000
Tracking and Data Fusion: A Handbook of Algorithms 1000
Models of Teaching(The 10th Edition,第10版!)《教学模式》(第10版!) 800
La décision juridictionnelle 800
Rechtsphilosophie und Rechtstheorie 800
Nonlocal Integral Equation Continuum Models: Nonstandard Symmetric Interaction Neighborhoods and Finite Element Discretizations 600
The risk of colorectal cancer in ulcerative colitis: a meta-analysis 500
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2875329
求助须知:如何正确求助?哪些是违规求助? 2486265
关于积分的说明 6732295
捐赠科研通 2169926
什么是DOI,文献DOI怎么找? 1152792
版权声明 585892
科研通“疑难数据库(出版商)”最低求助积分说明 565908