随机森林
水质
均方误差
环境科学
流域
叶绿素a
总有机碳
决定系数
溶解有机碳
氮气
人口
相关系数
水文学(农业)
统计
数学
机器学习
生态学
地理
计算机科学
化学
环境化学
地图学
工程类
生物化学
人口学
有机化学
岩土工程
社会学
生物
作者
Kyung‐Min Kim,Johng‐Hwa Ahn
标识
DOI:10.1016/j.jenvman.2022.115636
摘要
This study developed a model to predict concentrations of chlorophyll-a ([Chl-a]) as a proxy for algal population with data from multiple monitoring stations in the Han river basin, by using machine-learning predictive models, then analyzed the relationship between [Chl-a] and the input variables of the optimized model. Daily water quality and meteorological data from 2012 to 2020 were collected from the real-time water quality information system and the meteorological administration of Korea. To quantify model accuracy, the coefficient of determination, root mean square error, and mean absolute error were applied. Among random forest (RF), support vector machine, and artificial neural network, the RF with random dataset showed the highest accuracy. The RF was optimized when 78 trees were applied to the model. Input variables for the best RF model were total organic carbon (feature importance: 27%), total nitrogen (19%), pH (13%), water temperature (8%), total phosphorus (8%), electrical conductivity (7%), dissolved oxygen (6%), minimum air temperature (AT) (4%), mean AT (3%), and maximum AT (3%). The feature-importance analysis showed that total organic carbon was the most important variable to predict [Chl-a] in the Han river basin. Total nitrogen was a more important variable than total phosphorus.
科研通智能强力驱动
Strongly Powered by AbleSci AI