Evidence Reasoning and Curriculum Learning for Document-level Relation Extraction

关系抽取 计算机科学 关系(数据库) 判决 人工智能 任务(项目管理) 自然语言处理 信息抽取 强化学习 情报检索 数据挖掘 管理 经济
作者
Tianyu Xu,Jianfeng Qu,Wen Hua,Zhixu Li,Jiajie Xu,An Liu,Lei Zhao,Xiaofang Zhou
出处
期刊:IEEE Transactions on Knowledge and Data Engineering [Institute of Electrical and Electronics Engineers]
卷期号:: 1-14
标识
DOI:10.1109/tkde.2023.3292974
摘要

Document-level Relation Extraction (RE) is a promising task aiming at identifying relations of multiple entity pairs in a document. Compared with the sentence-level counterpart, it has raised two significant challenges: a) In most cases, a relational fact can be adequately expressed via a small subset of sentences from the document, namely evidence. But the traditional method cannot model such strong semantic correlations between evidence sentences that collaborate to describe a specific relation; b) The data of this task is extremely long-tail in terms of too many NA instances and imbalanced relational types. Such data can mislead the tail prediction bias to the head categories in the RE model. In this paper, we present a novel E vidence reasoning and C urriculum learning method for D oc RE (DRE-EC) to address these challenges. Particularly, we first formulate evidence extraction as a sequential decision problem through a crafted reinforcement learning mechanism with an efficient path searching strategy to reduce the action space. Providing the evidence for each entity pair as a customized-filtered document in advance helps infer the relations better. To address the long-tail issue, we further develop a hybrid curriculum learning method at the NA-level (NC) and relation-level (RC) with our customized difficulty measure score. In NC, the NA samples are scheduled in an easy-to-hard scheme and gradually added, resulting in the data distribution from ideal and balanced to real and unbalanced. In RC, the scheme is switched into hard-to-easy to enhance the hard and tail samples. In addition, we propose a new Equalization adaptive Focal Loss(EFLoss) that can adjust to the changing data distribution and focus more on the tail categories. We conduct various experiments on two document-level RE benchmarks and achieve a remarkable improvement over previous competitive baselines. Furthermore, we provide detailed analyses of the advantages and effectiveness of our method.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
称心曼安发布了新的文献求助10
2秒前
zfsn完成签到,获得积分10
3秒前
4秒前
十三发布了新的文献求助10
6秒前
时尚安荷完成签到,获得积分10
6秒前
勤劳傲旋发布了新的文献求助10
6秒前
7秒前
lanxinge完成签到 ,获得积分10
8秒前
123完成签到 ,获得积分10
12秒前
15秒前
15秒前
喜悦的板凳完成签到 ,获得积分10
16秒前
Ynwu完成签到 ,获得积分10
16秒前
123456@发布了新的文献求助10
17秒前
21秒前
21秒前
虎虎发布了新的文献求助10
25秒前
天天快乐应助雪山飞狐采纳,获得10
26秒前
安详的琳完成签到 ,获得积分10
27秒前
欣喜友梅完成签到,获得积分10
27秒前
XAASyysmc完成签到,获得积分10
27秒前
大个应助虎虎采纳,获得10
30秒前
呦呦完成签到 ,获得积分10
31秒前
隋阳完成签到,获得积分10
32秒前
小熊完成签到,获得积分10
33秒前
菜心完成签到,获得积分20
33秒前
今后应助123456@采纳,获得10
34秒前
小蘑菇应助Daniel采纳,获得10
35秒前
嗯哼举报健壮的语雪求助涉嫌违规
38秒前
38秒前
43秒前
43秒前
Lyyyw完成签到,获得积分10
44秒前
53秒前
mendicant完成签到,获得积分10
54秒前
55秒前
9sy完成签到,获得积分10
55秒前
妞妞发布了新的文献求助10
58秒前
积极的铃铛完成签到,获得积分10
1分钟前
扬大小汤完成签到,获得积分10
1分钟前
高分求助中
LNG地下式貯槽指針(JGA指-107) 1000
LNG地上式貯槽指針 (JGA指 ; 108) 1000
Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 900
Exploring Mitochondrial Autophagy Dysregulation in Osteosarcoma: Its Implications for Prognosis and Targeted Therapy 726
QMS18Ed2 | process management. 2nd ed 600
LNG as a marine fuel—Safety and Operational Guidelines - Bunkering 560
Clinical Interviewing, 7th ed 400
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2937820
求助须知:如何正确求助?哪些是违规求助? 2595026
关于积分的说明 6988965
捐赠科研通 2237973
什么是DOI,文献DOI怎么找? 1188473
版权声明 590010
科研通“疑难数据库(出版商)”最低求助积分说明 581755