计算机科学
关系(数据库)
利用
相似性(几何)
人工智能
自然语言处理
连接(主束)
语义关系
语义相似性
深度学习
联想(心理学)
情报检索
基础(线性代数)
数据挖掘
数学
图像(数学)
心理学
几何学
计算机安全
认知
神经科学
心理治疗师
作者
Liang Chen,Shuo Xu,Lijun Zhu,Jing Zhang,Guancan Yang,Haiyun Xu
标识
DOI:10.1016/j.joi.2022.101312
摘要
The deep learning has become an important technique for semantic relation classification in patent texts. Previous studies just borrowed the relevant models from generic texts to patent texts while keeping structure of the models unchanged. Due to significant distinctions between patent texts and generic ones, this enables the performance of these models in the patent texts to be reduced dramatically. To highlight these distinct characteristics in patent texts, seven annotated corpora from different fields are comprehensively compared in terms of several indicators for linguistic characteristics. Then, a deep learning based method is proposed to benefit from these characteristics. Our method exploits the information from other similar entity pairs as well as that from the sentences mentioning a focal entity pair. The latter stems from the conventional practices, and the former from our meaningful observation: the stronger the connection between two entity pairs is, the more likely they belong to the same relation type. To measure quantitatively the connection between two entity pairs, a similarity indicator on the basis of association rules is raised. Extensive experiments on the corpora of TFH-2020 and ChemProt demonstrate that our method for semantic relation classification is capable of benefiting from characteristic of patent texts.
科研通智能强力驱动
Strongly Powered by AbleSci AI