It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
Harmful algal blooms are becoming increasingly prevalent due to climate warming and eutrophication. Leveraging machine learning tools to forecast algal blooms is crucial and promising for bloom management in various water systems. Notably, previous findings are site-specific, especially regarding the impacts of forecasting periods and important input features. However, there is a significant research gap in the application of machine learning for predicting algal blooms in the Great Lakes, the world’s largest freshwater system. Thus, based on the measurements of 16 water quality parameters from 2012 to 2022, the author established the extreme gradient boosting (XGBoost) model to forecast chlorophyll a (Chl a, a proxy for algal biomass) concentration for 1–7 d in Lake Erie. The XGBoost model performance is quite satisfactory, with the lowest MSE of 10.94 and the highest R2 of 0.99 for the 1 d forecast and an MSE of 83.90 and an R2 of 0.90 for the 7 d forecast. Once trained, the model takes only a few seconds to run on an Intel Core i7 personal laptop. Based on Shapley additive explanations (SHAP) feature importance, water depth (Depth) and water temperature (Temp), are more important input features for the 7 d forecasting model than the well-recognized phosphorus and nitrogen nutrients, including particulate organic nitrogen (PON), soluble reactive phosphorous (SRP), nitrate + nitrite (NN), and total phosphorous (TP). Achieving relatively high accuracy for the 7 d forecast, with an R2 of 0.83 and an MSE of 144.40, is possible by using only the top six most important input features: initial Chl a, Depth, Temp, PON, SRP, and NN, based on SHAP feature importance results. These findings highlight the accuracy and efficiency of the developed XGBoost model to predict Chl a in the world’s largest freshwater system. The model can enhance algal bloom monitoring efficiency through early detection and key predictive features, supporting an early warning system for timely interventions, while also informing policy decisions and optimizing resource allocation.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer