Inicio  /  Water  /  Vol: 12 Par: 4 (2020)  /  Artículo
ARTÍCULO
TITULO

Machine Learning Approaches for Predicting Health Risk of Cyanobacterial Blooms in Northern European Lakes

Nikolaos Mellios    
S. Jannicke Moe and Chrysi Laspidou    

Resumen

Cyanobacterial blooms are considered a major threat to global water security with documented impacts on lake ecosystems and public health. Given that cyanobacteria possess highly adaptive traits that favor them to prevail under different and often complicated stressor regimes, predicting their abundance is challenging. A dataset from 822 Northern European lakes is used to determine which variables better explain the variation of cyanobacteria biomass (CBB) by means of stepwise multiple linear regression. Chlorophyll-a (Chl-a) and total nitrogen (TN) provided the best modelling structure for the entire dataset, while for subsets of shallow and deep lakes, Chl-a, mean depth, TN and TN/TP explained part of the variance in CBB. Path analysis was performed and corroborated these findings. Finally, CBB was translated to a categorical variable according to risk levels for human health associated with the use of lakes for recreational activities. Several machine learning methods, namely Decision Tree, K-Nearest Neighbors, Support-vector Machine and Random Forest, were applied showing a remarkable ability to predict the risk, while Random Forest parameters were tuned and optimized, achieving a 95.81% accuracy, exceeding the performance of all other machine learning methods tested. A confusion matrix analysis is performed for all machine learning methods, identifying the potential of each method to correctly predict CBB risk levels and assessing the extent of false alarms; random forest clearly outperforms the other methods with very promising results.

 Artículos similares

       
 
Zhenzhen Di, Miao Chang, Peikun Guo, Yang Li and Yin Chang    
Most worldwide industrial wastewater, including in China, is still directly discharged to aquatic environments without adequate treatment. Because of a lack of data and few methods, the relationships between pollutants discharged in wastewater and those ... ver más
Revista: Water

 
Ognjen Radovic,Srdan Marinkovic,Jelena Radojicic    
Credit scoring attracts special attention of financial institutions. In recent years, deep learning methods have been particularly interesting. In this paper, we compare the performance of ensemble deep learning methods based on decision trees with the b... ver más

 
Pablo de Llano, Carlos Piñeiro, Manuel Rodríguez     Pág. pp. 163 - 198
This paper offers a comparative analysis of the effectiveness of eight popular forecasting methods: univariate, linear, discriminate and logit regression; recursive partitioning, rough sets, artificial neural networks, and DEA. Our goals are: clarify the... ver más

 
Hugo López-Fernández     Pág. 22 - 25
Mass spectrometry using matrix assisted laser desorption ionization coupled to time of flight analyzers (MALDI-TOF MS) has become popular during the last decade due to its high speed, sensitivity and robustness for detecting proteins and peptides. This a... ver más

 
Rejath Jose, Faiz Syed, Anvin Thomas and Milan Toma    
The advancement of machine learning in healthcare offers significant potential for enhancing disease prediction and management. This study harnesses the PyCaret library?a Python-based machine learning toolkit?to construct and refine predictive models for... ver más
Revista: Applied Sciences