计算机科学
可扩展性
大数据
联动装置(软件)
记录链接
作者
Fuat Basik,Hakan Ferhatosmanoglu,Bugra Gedik
出处
期刊:International Conference on Management of Data
日期:2020-06-11
卷期号:: 1181-1196
被引量:2
标识
DOI:10.1145/3318464.3389761
摘要
We present a scalable solution to link entities across mobility datasets using their spatio-temporal information. This is a fundamental problem in many applications such as linking user identities for security, understanding privacy limitations of location based services, or producing a unified dataset from multiple sources for urban planning. Such integrated datasets are also essential for service providers to optimise their services and improve business intelligence. In this paper, we first propose a mobility based representation and similarity computation for entities. An efficient matching process is then developed to identify the final linked pairs, with an automated mechanism to decide when to stop the linkage. We scale the process with a locality-sensitive hashing (LSH) based approach that significantly reduces candidate pairs for matching. To realize the effectiveness and efficiency of our techniques in practice, we introduce an algorithm called SLIM. In the experimental evaluation, SLIM outperforms the two existing state-of-the-art approaches in terms of precision and recall. Moreover, the LSH-based approach brings two to four orders of magnitude speedup.
科研通智能强力驱动
Strongly Powered by AbleSci AI