计算机科学
接头(建筑物)
关系抽取
质量(理念)
人工智能
信息抽取
数据挖掘
知识抽取
数据提取
情报检索
自然语言处理
机器学习
梅德林
工程类
建筑工程
哲学
认识论
法学
政治学
作者
Jigen Luo,Yang Yuan,Jianqiang Du,Qiang Shi,Wangping Xiong,Qiming Zheng
标识
DOI:10.1145/3573428.3573668
摘要
TCM texts are rich in evidence-based information, and a crucial step in knowledge mining is using high-tech tools to organize and store TCM texts in a systematic manner. Knowledge graphs are better suited to organizing and preserving the knowledge of TCM texts with complex relationships than typical databases are. Building high-quality knowledge maps requires accurate and efficient entity relationship extraction, and completely automated entity relationship extraction necessitates the creation of a sizable amount of high-quality corpus data, which increases expenses and lowers productivity. Because the same entity can generate many relations in the joint extraction of TCM entity relations and there are insufficient corpus data, these issues must be addressed. The joint extraction model of TCM entity interactions proposed in this paper is based on deep learning and data augmentation. Using a multi-head selective bidirectional long and short-term memory network (multi-head-BILSTM), the relationship overlap problem is first solved, and the data is then enhanced using five mechanisms: entity replacement, random addition, random deletion, random replacement, and integrated enhancement. We quantitatively assess the benefits and drawbacks of several relationship extraction algorithms as well as the performance improvements of TCM entity relationship joint extraction brought about by five alternative data augmentation mechanisms. In conclusion, our research has the potential to significantly enhance the efficiency of joint TCM entity relationship extraction.
科研通智能强力驱动
Strongly Powered by AbleSci AI