层次聚类
持久同源性
薄雾
欧几里德距离
相似性(几何)
欧几里德几何
拓扑(电路)
空气质量指数
数据挖掘
聚类分析
相似性度量
计算机科学
地理
人工智能
数学
算法
气象学
图像(数学)
组合数学
几何学
作者
Nur Fariha Syaqina Zulkepli,Mohd Salmi Md Noorani,Fatimah Abdul Razak,Munira Ismail,Mohd Sharizal Alias
标识
DOI:10.1016/j.jenvman.2022.114434
摘要
Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations.
科研通智能强力驱动
Strongly Powered by AbleSci AI