Medical entity recognition and knowledge map relationship analysis of Chinese EMRs based on improved BiLSTM-CRF

计算机科学自然语言处理医学诊断人工智能病历鉴定（生物学）词汇注释情报检索精确性和召回率医学放射科语言学植物生物哲学

作者

Jia Ke,Weiji Wang,Xiaojun Chen,Jianping Gou,Yan Gao,Shuai Jin

出处

期刊：Computers & Electrical Engineering [Elsevier BV]
日期：2023-04-17 卷期号：108: 108709-108709 被引量：23

标识

DOI：10.1016/j.compeleceng.2023.108709

摘要

With the development of medical informatization, a large number of patients' electronic medical records (EMRs) have been accumulated in the hospital information system, which is characterized by multi-structured data form, diversified professional vocabulary categories and fuzzy vocabulary demarcation. Natural language processing (NLP) provides a silver lining for parsing electronic medical records, and the mainstream methods include dictionary-based methods, rule-based and statistical methods, and machine learning methods. Due to the semantic richness and structural diversity of Chinese text, Chinese EMRs analysis methods are still scarce compared with English EMRs. In order to overcome the problems of unstructured, multiple meanings of words and unclear word boundaries in text of Chinese electronic medical records, this paper proposes a medical entity recognition method based on RoFormerV2-BiLSTM-CRF fusion model, using BIO annotation method to annotate the recognized medical entities, and using knowledge graph to analyze the medical entity relationships identified in single patient medical record, multiple patient medical records respectively. The relationships between the medical entities identified in a single patient record and multiple patient records are analyzed using knowledge graphs. The experimental analysis was conducted on the expert-annotated dataset CCKS2019, and the results showed that the proposed method was effective for recognizing "Diseases and Diagnoses", "Laboratory Tests", "Imaging Examinations", "Anatomical Sites", "Drugs" and "Surgery" in the dataset. The average accuracy, recall and F1-Score of the proposed method for the identification of the six medical entities, are 84.8%, 83.5% and 83.9%, respectively, which were 5.3%, 8.6% and 6.8% higher than the traditional Word2Vec-BiLSTM-CRF model, and the existing Word2Vec-BiLSTM-CRF and BERT-BiLSTM-CRF models were iteratively trained with the same evaluation. The experimental results show that the proposed model performs better in medical entity recognition.

求助该文献

最长约 10秒，即可获得该文献文件

Medical entity recognition and knowledge map relationship analysis of Chinese EMRs based on improved BiLSTM-CRF

今日热心研友