Full text

Turn on search term navigation

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

The article proposes to use machine learning as one of the areas of artificial intelligence to forecast the volume of biogas production from household organic waste. The use of five regression algorithms (Linear Regression, Ridge Regression, Lasso Regression, Random Forest Regression, and Gradient Boosting Regression) to create an effective model for forecasting the volume of biogas production from household organic waste is considered. Based on the comparison of these algorithms by MSE and MAE indicators, the quality of training and their accuracy during forecasting are evaluated. The proposed algorithm for creating a model for forecasting biogas production volumes from household organic waste involves the implementation of 10 main and 3 auxiliary steps. Their advantage is that they aid in the performance of component data analysis, which is carried out based on the method of reducing the dimensionality of the data set, increasing interpretability, and minimizing the risk of data loss. An analysis of 2433 data is was carried out, which characterizes the formation of biogas from food (FW) and yard waste (YW) according to four features. Data preparation is performed using the Jupyter Notebook environment in Python. We select five machine learning algorithms to substantiate an effective model for forecasting volumes of biogas production from household organic waste. On the basis of the conducted research, the main advantages and disadvantages of the used algorithms for building forecasting models of biogas production volumes from household organic waste are determined. It is found that two models, “Random Forest Regressor” and “Gradient Boosting Regressor”, show the best accuracy indicators. The other three models (Linear Regression, Ridge Regression, Lasso Regression) are inferior in accuracy and were not considered further. To determine the accuracy of the “Random Forest Regressor” and “Gradient Boosting Regressor” models, we choose the MSE and MAE indicators. The Random Forest Regressor model is found to be a more accurate model compared to the Gradient Boosting Regressor. This is confirmed by the fact that the MSE of the “Random Forest Regressor” model on the training data set is 7.14 times smaller than that of the “Gradient Boosting Regressor” model. At the same time, MAE is 2.67 times smaller in the “Random Forest Regressor” model than in the “Gradient Boosting Regressor” model. The MSE and MAE of both models are worse on the test data set, which indicates overtraining tendencies. The Gradient Boosting Regressor model has worse MSE and MAE than the Random Forest Regressor model on both the training and test data sets. It is established that the model based on the “Random Forest Regressor” algorithm is the most effective for forecasting the volume of biogas production from household organic waste. It provides MAE = 0.088 on test data and the smallest absolute errors in predictions. Further systematic improvement of the “Random Forest Regressor” model for forecasting biogas production volumes from household organic waste based on new data will ensure its accuracy and maintain competitive advantages.

Details

Title
Prediction of Biogas Production Volumes from Household Organic Waste Based on Machine Learning
Author
Tryhuba, Inna 1 ; Tryhuba, Anatoliy 2   VIAFID ORCID Logo  ; Hutsol, Taras 3   VIAFID ORCID Logo  ; Cieszewska, Agata 4   VIAFID ORCID Logo  ; Andrushkiv, Oleh 5 ; Glowacki, Szymon 6   VIAFID ORCID Logo  ; Bryś, Andrzej 6   VIAFID ORCID Logo  ; Slobodian, Sergii 7 ; Tulej, Weronika 6   VIAFID ORCID Logo  ; Sojak, Mariusz 6   VIAFID ORCID Logo 

 Department of Information Technologies, Lviv National Environmental University, 80-381 Dublyany, Ukraine[email protected] (A.T.) 
 Department of Information Technologies, Lviv National Environmental University, 80-381 Dublyany, Ukraine[email protected] (A.T.); Ukrainian University in Europe—Foundation, Balicka 116, 30-149 Krakow, Poland 
 Department of Mechanics and Agroecosystems Engineering, Polissia National University, 10-008 Zhytomyr, Ukraine 
 Department of Landscape Architecture, Warsaw University of Life Sciences, Nowoursynowska 159, 02-787 Warsaw, Poland; [email protected] 
 Department of Information Technologies, Lviv State University of Life Safety, 79-007 Lviv, Ukraine; [email protected] 
 Department of Fundamentals of Engineering and Power Engineering, Institute of Mechanical Engineering, Warsaw University of Life Sciences (SGGW), 02-787 Warsaw, Poland[email protected] (W.T.); [email protected] (M.S.) 
 Department of Information Technology, Physical, Mathematical and Civil Defence Disciplines, Faculty of Energy and Information Technologies, Higher Educational Institution “Podillia State University”, 32-300 Kamianets-Podilskyi, Ukraine; [email protected] 
First page
1786
Publication year
2024
Publication date
2024
Publisher
MDPI AG
e-ISSN
19961073
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3037554726
Copyright
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.