计算机科学
背景(考古学)
编码器
模棱两可
光学(聚焦)
匹配(统计)
人工智能
代表(政治)
情报检索
多样性(控制论)
深度学习
自然语言处理
答疑
程序设计语言
数学
古生物学
统计
物理
政治
法学
政治学
光学
生物
操作系统
作者
Tiantian Zhu,Yang Qin,Ming Feng,Qingcai Chen,Baotian Hu,Yang Xiang
出处
期刊:IEEE/ACM transactions on audio, speech, and language processing
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:32: 374-385
标识
DOI:10.1109/taslp.2023.3331149
摘要
Recent research tends to address the biomedical entity linking problem in a unified framework solely based on surface form matching between mentions and entities. Specifically, these methods focus on addressing the variety challenge of the heterogeneous naming of biomedical concepts. Yet, the ambiguity challenge that the same word under different contexts can be used to refer to distinct concepts is usually ignored. To address this challenge, we propose BioPRO, a two-stage entity linking algorithm to enhance the biomedical entity representations based on context-infused prompt learning. The first stage includes a coarse-grained retrieval from a representation space defined by a bi-encoder that independently embeds the mention and entity's surface forms. Unlike previous one-model-fits-all systems, each candidate is then re-ranked with a fine-grained encoder based on prompt-tuning that sufficiently stimulates knowledge in contextual information of mentions and entities. Furthermore, the trained fine-grained encoder can be utilized to generate deep representations of bio-entities and boost candidate retrieval in the first stage. Extensive experiments show that our model achieves promising performance improvements compared with several state-of-the-art (SOTA) techniques on 4 biomedical corpora. We also observe by cases that the proposed context-infused prompt-tuning strategy is effective in solving both the variety and ambiguity challenges in the linking task.
科研通智能强力驱动
Strongly Powered by AbleSci AI