缺少数据
计算机科学
插补(统计学)
分水岭
数据挖掘
水质
背景(考古学)
工作(物理)
环境数据
水资源
机器学习
数据质量
工程类
运营管理
法学
政治学
机械工程
生态学
古生物学
公制(单位)
生物
作者
Marcos Pastorini,Rafael Rodríguez,Lorena Etcheverry,Alberto Castro,Angela Gorgoglione
标识
DOI:10.1016/j.scitotenv.2024.171773
摘要
In water resources management, new computational capabilities have made it possible to develop integrated models to jointly analyze climatic conditions and water quantity/quality of the entire watershed system. Although the value of this integrated approach has been demonstrated so far, the limited availability of field data may hinder its applicability by causing high uncertainty in the model response. In this context, before collecting additional data, it is recommended first to recognize what improvement in model performance would occur if all available records could be well exploited. This work proposes a novel machine learning framework with physical constraints capable of successfully imputing a high percentage of missing data belonging to several environmental domains (meteorology, water quantity, water quality), yielding satisfactory results. In particular, the minimum NSE computed for meteorologic variables is 0.72. For hydrometric variables, NSE is always >0.97. More than 78 % of the physical-water-quality variables is characterized by NSE > 0.45, and >66 % of the chemical-water quality variables reaches NSE > 0.35. This work's results demonstrate the proposed framework's effectiveness as a data augmentation tool to improve the performance of integrated environmental modeling.
科研通智能强力驱动
Strongly Powered by AbleSci AI