空气质量指数
自编码
多元统计
概率逻辑
时间序列
计算机科学
生成模型
期限(时间)
系列(地层学)
深度学习
环境科学
人工智能
机器学习
气象学
数据挖掘
统计
数学
生成语法
地理
古生物学
物理
量子力学
生物
作者
Cooper Loughlin,Dimitris G. Manolakis,Vinay K. Ingle
摘要
Monitoring of air pollutants across space and time is critical in understanding pollution trends and reporting air quality. The Air Quality Index (AQI) is a tool used to communicate air quality that incorporates atmospheric concentrations of five major pollution indicators: ground-level ozone, particulate matter, carbon monoxide, sulfur dioxide, and nitrogen dioxide. The ability to accurately forecast these concentrations and identify unusual levels is of particular importance. In this work, we develop a generative time series model for air quality indicators and use it for long and short-term probabilistic forecasts. Air quality data are multivariate and exhibit high variability across indicators in both space and time. Marginal indicator distributions are typically skewed and contain substantial zeros, while indicator-wise cross-correlations can be highly non-linear. We find that hourly measurements additionally exhibit substantial temporal cross-correlation, long-term dependence, and daily periodicity. To capture these complexities, we employ a recurrent extension of the variational autoencoder (VAE) to sequential data. The VAE is a generative neural network architecture capable of learning complex, high dimensional manifolds on which data are distributed. Furthermore, recurrent architectures can capture non-linear and long-term temporal qualities of time series data. We train the proposed time series model on historical air quality measurements at multiple locations and demonstrate its ability to capture observed indicator-wise and temporal complexities. We additionally use the trained model to compute probabilistic forecasts and credible intervals of air quality indicators.
科研通智能强力驱动
Strongly Powered by AbleSci AI