计算机科学
关系抽取
判决
粒度
人工智能
关系(数据库)
提取器
水准点(测量)
噪音(视频)
光学(聚焦)
任务(项目管理)
人工神经网络
词(群论)
自然语言处理
机器学习
学习迁移
信息抽取
数据挖掘
地理
工程类
管理
经济
哲学
大地测量学
物理
光学
图像(数学)
操作系统
语言学
工艺工程
作者
Xinsong Zhang,Tianyi Liu,Pengshuai Li,Weijia Jia,Hai Zhao
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2021-09-01
卷期号:33 (9): 3297-3310
被引量:12
标识
DOI:10.1109/tkde.2020.2964747
摘要
Distant supervision is widely used to extract relational facts with automatically labeled datasets to reduce high cost of human annotation. However, current distantly supervised methods suffer from the common problems of word-level and sentence-level noises, which come from a large proportion of irrelevant words in a sentence and inaccurate relation labels for numerous sentences. The problems lead to unacceptable precision in relation extraction and are critical for the success of using distant supervision. In this paper, we propose a novel and robust neural approach to deal with both problems by reducing influences of the multi-granularity noises. Three levels of noises from word, sentence until knowledge type are carefully considered in this work. We first initiate a question-answering based relation extractor (QARE) to remove noisy words in a sentence. Then we use multi-focus multi-instance learning (MMIL) to alleviate the effects of sentence-level noise by utilizing wrongly labeled sentences properly. Finally, to enhance our method against all the noises, we initialize parameters in our method with a priori knowledge learned from the relevant task of entity type classification by transfer learning. Extensive experiments on both existing benchmark and an improved larger dataset demonstrate that our proposed approach remarkably achieves new state-of-the-art performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI