计算机科学
一般化
关系抽取
基线(sea)
对象(语法)
嵌入
人工智能
特征提取
关系模型
特征(语言学)
人工神经网络
关系(数据库)
关系数据库
数据挖掘
模式识别(心理学)
机器学习
数学
数学分析
语言学
海洋学
哲学
地质学
作者
Weiqun Luo,Jiabao Wang,Xiangwei Yan,Guiyuan Jiang
标识
DOI:10.20944/preprints202309.1397.v1
摘要
To address the deficiency of existing relation extraction models in effectively extracting relational triplets pertaining to railway traffic knowledge in Tibet, this paper constructs a Tibet Railway Traffic text dataset and provides an enhanced relation extraction model. The proposed model incorporates subject feature enhancement and relational attention mechanisms. It leverages a pre-trained model as the embedding layer to obtain vector representations of text. Subsequently, the subject is extracted and its semantic information is augmented using an LSTM neural network. Furthermore, during object extraction, the multi-head attention mechanism enables the model to prioritize relations associated with the aforementioned features. Finally, objects are extracted based on the subjects and relations. The proposed method has been comprehensively evaluated on multiple datasets, including the Tibet Railway Traffic text dataset and two public datasets. The results on the Tibet dataset achieves an F1-score of 93.3\%, surpassing the baseline model CasRel by 0.8\%, indicating a superior applicability of the proposed model. On the other hand, the model achieves F1-scores of 91.1\% and 92.6\% on two public datasets, NYT and WebNLG, respectively, outperforming the baseline CasRel by 1.5\% and 0.8\%, which highlights the good generalization ability of the proposed model.
科研通智能强力驱动
Strongly Powered by AbleSci AI