任务(项目管理)
领域(数学分析)
自然语言处理
注释
计算机科学
人工智能
信息抽取
关系抽取
情报检索
数据挖掘
数据科学
数学
数学分析
经济
管理
作者
Tongfeng Guan,Hongying Zan,Xiabing Zhou,Hongfei Xu,Kunli Zhang
标识
DOI:10.1007/978-3-030-60450-9_22
摘要
In this paper, we present the Chinese Medical Information Extraction (CMeIE) dataset, consisting of 28, 008 sentences, 85, 282 triplets, 11 entities, and 44 relations derived from medical textbooks and clinical practices, constructed by several rounds of manual annotation. Additionally, we evaluate performances of the most recent state-of-the-art frameworks and pre-trained language models for the joint extraction of entities and relations task on the CMeIE dataset. Experiment results show that even these most advanced models still have a large space to improve on our dataset; currently, the best F1 score on the dataset is 58.44%. Our analysis points out several challenges and multiple potential future research directions for the task specialized in the medical domain.
科研通智能强力驱动
Strongly Powered by AbleSci AI