A BERT-Based Named Entity Recognition in Chinese Electronic Medical Record
计算机科学
命名实体识别
自然语言处理
人工智能
情报检索
工程类
系统工程
任务(项目管理)
作者
Qingchuan Wang,Haihong E
标识
DOI:10.1145/3436369.3436390
摘要
Named entity recognition, aiming at identifying and classifying named entity mentioned in the structured or unstructured text, is a fundamental subtask for information extraction in natural language processing (NLP). With the development of electronic medical records, obtaining the key and effective information in electronic document through named entity identification has become an increasingly popular research direction. In this article, we adapt a recently introduced pre-trained language model BERT for named entity recognition in electronic medical records to solve the problem of missing context information and we add an extra mechanism to capture the relationship between words. Based on this, (1) the entities can be represented by sentence-level vector, with the forward as well as backward information of the sentence, which can be directly used by downstream tasks; (2) the model acquires the representation of word in context and learn the potential relation between words to decrease the influence of inconsistent entity markup problem of a text. We conduct experiments an electronic medical record dataset proposed by China Conference on Knowledge Graph and Semantic Computing in 2019. The experimental result shows that our proposed method has an improvement compared with the traditional methods.