Inicio  /  Water  /  Vol: 15 Par: 3 (2023)  /  Artículo
ARTÍCULO
TITULO

A Machine Learning Approach to Predict Watershed Health Indices for Sediments and Nutrients at Ungauged Basins

Ganeshchandra Mallya    
Mohamed M. Hantush and Rao S. Govindaraju    

Resumen

Effective water quality management and reliable environmental modeling depend on the availability, size, and quality of water quality (WQ) data. Observed stream water quality data are usually sparse in both time and space. Reconstruction of water quality time series using surrogate variables such as streamflow have been used to evaluate risk metrics such as reliability, resilience, vulnerability, and watershed health (WH) but only at gauged locations. Estimating these indices for ungauged watersheds has not been attempted because of the high-dimensional nature of the potential predictor space. In this study, machine learning (ML) models, namely random forest regression, AdaBoost, gradient boosting machines, and Bayesian ridge regression (along with an ensemble model), were evaluated to predict watershed health and other risk metrics at ungauged hydrologic unit code 10 (HUC-10) basins using watershed attributes, long-term climate data, soil data, land use and land cover data, fertilizer sales data, and geographic information as predictor variables. These ML models were tested over the Upper Mississippi River Basin, the Ohio River Basin, and the Maumee River Basin for water quality constituents such as suspended sediment concentration, nitrogen, and phosphorus. Random forest, AdaBoost, and gradient boosting regressors typically showed a coefficient of determination R2>0.8" role="presentation" style="position: relative;">??2>0.8R2>0.8 R 2 > 0.8 for suspended sediment concentration and nitrogen during the testing stage, while the ensemble model exhibited R2>0.95" role="presentation" style="position: relative;">??2>0.95R2>0.95 R 2 > 0.95 . Watershed health values with respect to suspended sediments and nitrogen predicted by all ML models including the ensemble model were lower for areas with larger agricultural land use, moderate for areas with predominant urban land use, and higher for forested areas; the trained ML models adequately predicted WH in ungauged basins. However, low WH values (with respect to phosphorus) were predicted at some basins in the Upper Mississippi River Basin that had dominant forest land use. Results suggest that the proposed ML models provide robust estimates at ungauged locations when sufficient training data are available for a WQ constituent. ML models may be used as quick screening tools by decision makers and water quality monitoring agencies for identifying critical source areas or hotspots with respect to different water quality constituents, even for ungauged watersheds.

 Artículos similares

       
 
Zhenzhen Di, Miao Chang, Peikun Guo, Yang Li and Yin Chang    
Most worldwide industrial wastewater, including in China, is still directly discharged to aquatic environments without adequate treatment. Because of a lack of data and few methods, the relationships between pollutants discharged in wastewater and those ... ver más
Revista: Water

 
Ognjen Radovic,Srdan Marinkovic,Jelena Radojicic    
Credit scoring attracts special attention of financial institutions. In recent years, deep learning methods have been particularly interesting. In this paper, we compare the performance of ensemble deep learning methods based on decision trees with the b... ver más

 
Pablo de Llano, Carlos Piñeiro, Manuel Rodríguez     Pág. pp. 163 - 198
This paper offers a comparative analysis of the effectiveness of eight popular forecasting methods: univariate, linear, discriminate and logit regression; recursive partitioning, rough sets, artificial neural networks, and DEA. Our goals are: clarify the... ver más

 
Hugo López-Fernández     Pág. 22 - 25
Mass spectrometry using matrix assisted laser desorption ionization coupled to time of flight analyzers (MALDI-TOF MS) has become popular during the last decade due to its high speed, sensitivity and robustness for detecting proteins and peptides. This a... ver más

 
Rejath Jose, Faiz Syed, Anvin Thomas and Milan Toma    
The advancement of machine learning in healthcare offers significant potential for enhancing disease prediction and management. This study harnesses the PyCaret library?a Python-based machine learning toolkit?to construct and refine predictive models for... ver más
Revista: Applied Sciences