计算机科学
自然语言处理
判决
人工智能
命名实体识别
模棱两可
利用
实体链接
光学(聚焦)
词(群论)
背景(考古学)
图形
情报检索
任务(项目管理)
语言学
古生物学
哲学
经济
物理
管理
光学
程序设计语言
生物
理论计算机科学
计算机安全
知识库
作者
Yiting Yu,Zanbo Wang,Wei Wei,Ruihan Zhang,Xian-Ling Mao,Shanshan Feng,Fei Wang,Zhiyong He,Sheng Jiang
标识
DOI:10.1016/j.knosys.2023.111266
摘要
Named entity recognition (NER, also known as entity chunking/extraction) is a fundamental sub-task of information extraction, which aims at identifying named entities from an unstructured text into pre-defined classes. Most of the existing works mainly focus on modeling local-context dependencies in a single sentence for entity type prediction. However, they may neglect the clues derived from other sentences within a document, and thus suffer from the sentence-level inherent ambiguity issue, which may make their performance drop to some extent. To this end, we propose a Global Context enhanced Document-level NER (GCDoc) model for NER to fully exploit the global contextual information of a document in different levels, i.e., word-level and sentence-level. Specifically, GCDoc constructs a document graph to capture the global dependencies of words for enriching the representations of each word in word-level. Then, it encodes the adjacent sentences for exploring the contexts across sentences to enhance the representation of the current sentence via the specially devised attention mechanism. Extensive experiments on two benchmark NER datasets (i.e., CoNLL 2003 and Onenotes 5.0 English dataset) demonstrate the effectiveness of our proposed model, as compared to the competitive baselines.
科研通智能强力驱动
Strongly Powered by AbleSci AI