计算机科学
关系抽取
稳健性(进化)
水准点(测量)
关系(数据库)
一般化
人工智能
自然语言处理
对抗制
判决
资源(消歧)
情报检索
数据挖掘
数学
数学分析
基因
化学
生物化学
地理
计算机网络
大地测量学
作者
Ruoyu Zhang,Yanzeng Li,Zhang Min-hao,Lei Zou
标识
DOI:10.1145/3539618.3591984
摘要
Recent years have witnessed the transition from sentence-level to document-level in relation extraction (RE), with new formulation, new methods and new insights. Yet, the fundamental concept, mention, is not well-considered and well-defined. Current datasets usually use automatically-detected named entities as mentions, which leads to the missing reference problem. We show that such phenomenon hinders models' reasoning abilities. To address it, we propose to incorporate coreferences (e.g. pronouns and common nouns) into mentions, based on which we refine and re-annotate the widely-used DocRED benchmark as R-DocRED. We evaluate various methods and conduct thorough experiments to demonstrate the efficacy of our formula. Specifically, the results indicate that incorporating coreferences helps reduce the long-term dependencies, further improving models' robustness and generalization under adversarial and low-resource settings. The new dataset is made publicly available for future research.
科研通智能强力驱动
Strongly Powered by AbleSci AI