计算机科学
质量(理念)
可靠性(半导体)
集合(抽象数据类型)
数据挖掘
对象(语法)
财产(哲学)
订单(交换)
人工智能
物理
哲学
经济
功率(物理)
认识论
程序设计语言
量子力学
财务
作者
Daniel Faria,Alfio Ferrara,Ernesto Jiménez-Ruiz,Stefano Montanelli,Cátia Pesquita
标识
DOI:10.1017/s0269888920000363
摘要
Abstract The quality of a dataset used for evaluating data linking methods, techniques, and tools depends on the availability of a set of mappings, called reference alignment , that is known to be correct. In particular, it is crucial that mappings effectively represent relations between pairs of entities that are indeed similar due to the fact that they denote the same object. Since the reliability of mappings is decisive in order to perform a fair evaluation of automatic linking methods and tools, we call this property of mappings as mapping fairness . In this article, we propose a crowd-based approach, called Crowd Quality ( CQ ), for assessing the quality of data linking datasets by measuring the fairness of the mappings in the reference alignment. Moreover, we present a real experiment, where we evaluate two state-of-the-art data linking tools before and after the refinement of the reference alignment based on the CQ approach, in order to present the benefits deriving from the crowd assessment of mapping fairness.
科研通智能强力驱动
Strongly Powered by AbleSci AI