Start From Zero: Triple Set Prediction for Automatic Knowledge Graph Completion

计算机科学缺少数据集合（抽象数据类型）嵌入公制（单位）图形排名（信息检索）任务（项目管理）人工智能理论计算机科学数据挖掘机器学习运营管理管理经济程序设计语言

作者

Wen Zhang,Yajing Xu,Peng Ye,Zhiwei Huang,Zezhong Xu,Jiaoyan Chen,Jeff Z. Pan,Huajun Chen

出处

期刊：IEEE Transactions on Knowledge and Data Engineering [Institute of Electrical and Electronics Engineers]
日期：2024-01-01 卷期号：: 1-14

链接

arxiv.org arxiv.orgdoi.org

标识

DOI：10.1109/tkde.2024.3399832

摘要

Knowledge graph (KG) completion aims to find out missing triples in a KG. Some tasks, such as link prediction and instance completion, have been proposed for KG completion. They are triple-level tasks with some elements in a missing triple given to predict the missing element of the triple. However, knowing some elements of the missing triple in advance is not always a realistic setting. In this paper, we propose a novel graph-level automatic KG completion task called Triple Set Prediction (TSP) which assumes none of the elements in the missing triples is given. TSP is to predict a set of missing triples given a set of known triples. To properly and accurately evaluate this new task, we propose 4 evaluation metrics including 3 classification metrics and 1 ranking metric, considering both the partial-open-world and the closed-world assumptions. Furthermore, to tackle the huge candidate triples for prediction, we propose a novel and efficient subgraph-based method GPHT that can predict the triple set fast. To fairly compare the TSP results, we also propose two types of methods RuleTensor-TSP and KGE-TSP applying the existing rule- and embedding-based methods for TSP as baselines. During experiments, we evaluate the proposed methods on two datasets extracted from Wikidata following the relation-similarity partial-open-world assumption proposed by us, and also create a complete family data set to evaluate TSP results following the closed-world assumption. Results prove that the methods can successfully generate a set of missing triples and achieve reasonable scores on the new task, and GPHT performs better than the baselines with significantly shorter prediction time. The datasets and code for experiments are available at https://github.com/zjukg/GPHT-for-TSP.

求助该文献

最长约 10秒，即可获得该文献文件

Start From Zero: Triple Set Prediction for Automatic Knowledge Graph Completion

今日热心研友