指针(用户界面)
计算机科学
自然语言处理
人工智能
作者
Xiaohui Cui,Chao Song,Dongmei Li,Xiaolong Qu,Jiao Long,Yang Yu,Hanchao Zhang
出处
期刊:Computers, materials & continua
日期:2024-01-01
卷期号:78 (3): 3603-3618
被引量:1
标识
DOI:10.32604/cmc.2024.047321
摘要
Named Entity Recognition (NER) stands as a fundamental task within the field of biomedical text mining, aiming to extract specific types of entities such as genes, proteins, and diseases from complex biomedical texts and categorize them into predefined entity types.This process can provide basic support for the automatic construction of knowledge bases.In contrast to general texts, biomedical texts frequently contain numerous nested entities and local dependencies among these entities, presenting significant challenges to prevailing NER models.To address these issues, we propose a novel Chinese nested biomedical NER model based on RoBERTa and Global Pointer (RoBGP).Our model initially utilizes the RoBERTa-wwm-ext-large pretrained language model to dynamically generate word-level initial vectors.It then incorporates a Bidirectional Long Short-Term Memory network for capturing bidirectional semantic information, effectively addressing the issue of long-distance dependencies.Furthermore, the Global Pointer model is employed to comprehensively recognize all nested entities in the text.We conduct extensive experiments on the Chinese medical dataset CMeEE and the results demonstrate the superior performance of RoBGP over several baseline models.This research confirms the effectiveness of RoBGP in Chinese biomedical NER, providing reliable technical support for biomedical information extraction and knowledge base construction.
科研通智能强力驱动
Strongly Powered by AbleSci AI