无人机
RGB颜色模型
水质
环境科学
遥感
人工智能
灌溉
质量(理念)
计算机科学
水文学(农业)
机器学习
工程类
地质学
生态学
生物
岩土工程
哲学
遗传学
认识论
作者
Seok Min Hong,Billie J. Morgan,Matthew Stocker,Jaclyn Smith,Moon S. Kim,Kyung Hwa Cho,Yakov Pachepsky
出处
期刊:Water Research
[Elsevier BV]
日期:2024-05-31
卷期号:260: 121861-121861
被引量:10
标识
DOI:10.1016/j.watres.2024.121861
摘要
The rapid and efficient quantification of Escherichia coli concentrations is crucial for monitoring water quality. Remote sensing techniques and machine learning algorithms have been used to detect E. coli in water and estimate its concentrations. The application of these approaches, however, is challenged by limited sample availability and unbalanced water quality datasets. In this study, we estimated the E. coli concentration in an irrigation pond in Maryland, USA, during the summer season using demosaiced natural color (red, green, and blue: RGB) imagery in the visible and infrared spectral ranges, and a set of 14 water quality parameters. We did this by deploying four machine learning models - Random Forest (RF), Gradient Boosting Machine (GBM), Extreme Gradient Boosting (XGB), and K-nearest Neighbor (KNN) - under three data utilization scenarios: water quality parameters only, combined water quality and small unmanned aircraft system (sUAS)-based RGB data, and RGB data only. To select the training and test datasets, we applied two data-splitting methods: ordinary and quantile data splitting. These methods provided a constant splitting ratio in each decile of the E. coli concentration distribution. Quantile data splitting resulted in better model performance metrics and smaller differences between the metrics for both the training and testing datasets. When trained with quantile data splitting after hyperparameter optimization, models RF, GBM, and XGB had R
科研通智能强力驱动
Strongly Powered by AbleSci AI