Inicio  /  Hydrology  /  Vol: 10 Par: 4 (2023)  /  Artículo
ARTÍCULO
TITULO

A Machine-Learning Framework for Modeling and Predicting Monthly Streamflow Time Series

Hatef Dastour and Quazi K. Hassan    

Resumen

Having a complete hydrological time series is crucial for water-resources management and modeling. However, this can pose a challenge in data-scarce environments where data gaps are widespread. In such situations, recurring data gaps can lead to unfavorable outcomes such as loss of critical information, ineffective model calibration, inaccurate timing of peak flows, and biased statistical analysis in various applications. Despite its importance, predicting monthly streamflow can be a complex task due to its connection to random dynamics and uncertain phenomena, posing significant challenges. This study introduces an ensemble machine-learning regression framework for modeling and predicting monthly streamflow time series with a high degree of accuracy. The framework utilizes historical data from multiple monthly streamflow datasets in the same region to predict missing monthly streamflow data. The framework selects the best features from all available gap-free monthly streamflow time-series combinations and identifies the optimal model from a pool of 12 machine-learning models, including random forest regression, gradient boosting regression, and extra trees regressor, among others. The model selection is based on cross-validation train-and-test set scores, as well as the coefficient of determination. We conducted modeling on 26 monthly streamflow time series and found that the gradient boosting regressor with bagging regressor produced the highest accuracy in 7 of the 26 instances. Across all instances, the models using this method exhibited an overall accuracy range of 0.9737 to 0.9968. Additionally, the use of either a bagging regressor or an AdaBoost regressor improved both the tree-based and gradient-based models, resulting in these methods accounting for nearly 80% of the best models. Between January 1960 and December 2021, an average of 40% of the monthly streamflow data was missing for each of the 26 stations. Notably, two crucial stations located in the economically significant lower Athabasca Basin River in Alberta province, Canada, had approximately 70% of their monthly streamflow data missing. To address this issue, we employed our framework to accurately extend the missing data for all 26 stations. These accurate extensions also allow for further analysis, including grouping stations with similar monthly streamflow behavior using Pearson correlation.

 Artículos similares

       
 
Alireza Hajiheidari, Mahmoud Reza Delavar and Abbas Rajabifard    
Enriching and updating maps are among the most important tasks of any urban management organization for informed decision making. Urban cadastral map enrichment is a time-consuming and costly process, which needs an expert?s opinion for quality control. ... ver más

 
Enrique González-Núñez, Luis A. Trejo and Michael Kampouridis    
This research aims at applying the Artificial Organic Network (AON), a nature-inspired, supervised, metaheuristic machine learning framework, to develop a new algorithm based on this machine learning class. The focus of the new algorithm is to model and ... ver más

 
Davy Preuveneers and Wouter Joosen    
Ontologies have the potential to play an important role in the cybersecurity landscape as they are able to provide a structured and standardized way to semantically represent and organize knowledge about a domain of interest. They help in unambiguously m... ver más
Revista: Future Internet

 
Timothy O. Hodson, Keith J. Doore, Terry A. Kenney, Thomas M. Over and Muluken B. Yeheyis    
Streamflow is one of the most important variables in hydrology, but it is difficult to measure continuously. As a result, nearly all streamflow time series are estimated from rating curves that define a mathematical relationship between streamflow and so... ver más
Revista: Hydrology

 
Benny Wijaya, Mengmeng Yang, Tuopu Wen, Kun Jiang, Yunlong Wang, Zheng Fu, Xuewei Tang, Dennis Octovan Sigomo, Jinyu Miao and Diange Yang    
This research paper employed a multi-session framework to present an innovative approach to map monitoring within the domain of high-definition (HD) maps. The proposed methodology uses a machine learning algorithm to derive a confidence level for the det... ver más