Predicting PM2.5 levels and exceedance days using machine learning methods

支持向量机随机森林决策树环境科学人工神经网络机器学习相对湿度气象学空气质量指数人工智能计算机科学地理

作者

Ziqi Gao,Khanh Do,Zongrun Li,Xiangyu Jiang,Kamal Jyoti Maji,Cesunica E. Ivey,Armistead G. Russell

出处

期刊：Atmospheric Environment [Elsevier]
日期：2024-02-11 卷期号：323: 120396-120396 被引量：5

标识

DOI：10.1016/j.atmosenv.2024.120396

摘要

Machine learning methods are increasingly being used in the field of air quality research to investigate the relationship between air pollutant levels, emissions, and meteorological changes over time. This research is used for both scientific investigation, and policy assessment and development. However, there is a lack of studies that have compared the performance of different machine learning methods. To address this gap, this paper employed various machine learning techniques, including decision tree, random forest (RF), support vector machine (SVM), support vector regression (SVR), k-nearest neighbor, neural network, and Gaussian process regression, to predict daily average PM2.5 levels and the number of days with PM2.5 exceedance in the South Coast Air Basin of California from 2000 to 2019. The models were trained using meteorological factors, estimated emissions, and large-scale climate indices as inputs. The SVR model demonstrated the highest predictive accuracy for PM2.5 levels and the SVM model gave the most accurate results for predicting the number of days with PM2.5 exceedances. Conversely, the decision tree model performed the least accurately. The results also showed that emissions have a greater impact on PM2.5 levels over time compared to meteorological factors, though meteorology is responsible for daily variability. The most important meteorological factors were identified as surface relative humidity and relative humidity at 850 mbars, which are related to partitioning, cloud cover and wet deposition. We conducted sensitivity tests on the model's response to emissions and meteorological factors. The predicted PM2.5 from RF and SVR showed large correlations with emissions at the early period (2000–2010). However, the changes were minimal in more recent years (2011–2019), implying that there are biases in machine learning models, in which the models consistently predict the minimum PM2.5 levels at a baseline.

求助该文献

Predicting PM2.5 levels and exceedance days using machine learning methods

今日热心研友