计算机科学
信息抽取
解析
任务(项目管理)
过程(计算)
简单(哲学)
实体链接
自然语言处理
萃取(化学)
情报检索
人工智能
嵌套循环联接
数据挖掘
程序设计语言
知识库
管理
化学
经济
哲学
认识论
色谱法
作者
Jiawei Wang,Xin Zheng,Qiang Yang,Jianfeng Qu,Jiajie Xu,Zhigang Chen,Zhixu Li
出处
期刊:Communications in computer and information science
日期:2021-01-01
卷期号:: 185-197
标识
DOI:10.1007/978-981-16-6471-7_14
摘要
Open Information Extraction is a crucial task in natural language processing with wide applications. Existing efforts only work on extracting simple flat triplets that are not minimized, which neglect triplets of other kinds and their nested combinations. As a result, they cannot provide comprehensive extraction results for its downstream tasks. In this paper, we define three more fine-grained types of triplets, and also pay attention to the nested combination of these triplets. Particular, we propose a novel end-to-end joint extraction model, which identifies the basic semantic elements, comprehensive types of triplets, as well as their nested combinations from plain texts jointly. In this way, information is shared more thoroughly in the whole parsing process, which also lets the model achieve more fine-grained knowledge extraction without relying on external NLP tools or resources. Our empirical study on datasets of two domains, Building Codes and Biomedicine, demonstrates the effectiveness of our model comparing to state-of-the-art approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI