文档
计算机科学
关系抽取
管道(软件)
背景(考古学)
任务(项目管理)
过程(计算)
关系(数据库)
人工智能
自然语言处理
F1得分
信息抽取
实体链接
精确性和召回率
情报检索
机器学习
数据挖掘
知识库
工程类
程序设计语言
古生物学
系统工程
生物
作者
Jiajing Liu,Hanbin Luo,Weili Fang,Peter E.D. Love
标识
DOI:10.1016/j.aei.2023.102194
摘要
Typically named entity recognition (NER) and relation extraction (RE) from safety documentation (e.g., accident reports) adopt a pipeline processing approach whereby tasks are split into two sub-tasks. As a result, error propagation occurs between components, and useful information from one task may go unexploited by the other. Additionally, training sets to perform NER and RE from safety documentation are often limited and context-specific. Thus, our research addresses the following question: How can we accurately identify entities and extract relations from safety documentation using limited training sets? This paper utilizes 'contrastive learning' to tackle our research question. It proposes a contrastive learning-based cascade binary tagging framework (CasRel) to automatically and synchronously extract entities and relations from safety documents. A five-fold cross-validation process is used to validate the effectiveness and feasibility of our approach. Results from the validation process achieve an average precision of 77.8%, recall of 58.7%, and F1-score of 66.9%, outperforming CasRel with an increase of about 10% in precision, 5% in recall, and 7% in F1-score. Thus, our approach can accurately recognize entities and extract relations from safety documentation. The contributions of our study are twofold: (1) an improved unified model is developed to recognize and extract the entity and relation from safety documents to reduce error propagation and improve its accuracy; and (2) the concept of 'contrastive learning' is introduced in the design of the joint entity and relation extraction model with limited training sets.
科研通智能强力驱动
Strongly Powered by AbleSci AI