计算机科学
条件随机场
人工智能
水准点(测量)
性格(数学)
嵌入
命名实体识别
特征(语言学)
自然语言处理
任务(项目管理)
深度学习
语言学
哲学
几何学
数学
管理
大地测量学
经济
地理
作者
Ying An,Xianyun Xia,Xianlai Chen,Fang-Xiang Wu,Jianxin Wang
标识
DOI:10.1016/j.artmed.2022.102282
摘要
Clinical named entity recognition (CNER) is a fundamental step for many clinical Natural Language Processing (NLP) systems, which aims to recognize and classify clinical entities such as diseases, symptoms, exams, body parts and treatments in clinical free texts. In recent years, with the development of deep learning technology, deep neural networks (DNNs) have been widely used in Chinese clinical named entity recognition and many other clinical NLP tasks. However, these state-of-the-art models failed to make full use of the global information and multi-level semantic features in clinical texts. We design an improved character-level representation approach which integrates the character embedding and the character-label embedding to enhance the specificity and diversity of feature representations. Then, a multi-head self-attention based Bi-directional Long Short-Term Memory Conditional Random Field (MUSA-BiLSTM-CRF) model is proposed. By introducing the multi-head self-attention and combining a medical dictionary, the model can more effectively capture the weight relationships between characters and multi-level semantic feature information, which is expected to greatly improve the performance of Chinese clinical named entity recognition. We evaluate our model on two CCKS challenge (CCKS2017 Task 2 and CCKS2018 Task 1) benchmark datasets and the experimental results show that our proposed model achieves the best performance competing with the state-of-the-art DNN based methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI