计算机科学
数据挖掘
聚类分析
传感器融合
多源
数据集
热点(地质)
人工智能
数学
地球物理学
统计
地质学
作者
Li Cai,Haoyu Wang,Cong Sha,Fang Jiang,Yihang Zhang,Wei Zhou
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2021-01-01
卷期号:: 1-1
被引量:1
标识
DOI:10.1109/tkde.2021.3109581
摘要
Urban hotspots reflect the degree of residents' travel gathering. The study of urban hotspots has important values for urban infrastructure planning, public security and other aspects. In existing researches, single-source location data and density-based clustering algorithms are used to mine hotspots. Due to the one-sidedness of using the single-source data, the mining of hotspots based on multi-source location data fusion has become a hot topic. Multi-source location data fusion requires a quantity balance between the data sets to be fused, because several famous clustering algorithms cannot handle multi-source imbalanced data sets. To solve this problem, we propose a novel framework to mine urban hotspots. First, we construct a data imputation model for the sparse data set so that reducing the difference in quantity between two types of data sets. Then, a clustering algorithm for imbalanced data sets is proposed, and a novel evaluation metric is designed to verify the effectiveness of clustering results. The experiment uses real data sets including POI data, check-in data and GPS trajectory data. The results show that the proposed method discovers all urban hotspots formed by fused imbalanced data sets, and it is more accurate and efficient than the state-of-the-art algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI