About the Authors:
Enes Gul
Roles Conceptualization, Data curation, Formal analysis, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing
Affiliation: Department of Civil Engineering, Inonu University, Malatya, Turkey
Mir Jafar Sadegh Safari
Roles Conceptualization, Data curation, Formal analysis, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing
Affiliation: Department of Civil Engineering, Yaşar University, Izmir, Turkey
ORCID logo https://orcid.org/0000-0003-0559-5261
Ali Torabi Haghighi
Roles Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing
* E-mail: [email protected]
Affiliation: Water, Energy and Environmental Engineering Research Unit, University of Oulu, Oulu, Finland
ORCID logo https://orcid.org/0000-0002-5157-0156
Ali Danandeh Mehr
Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Supervision, Validation, Visualization, Writing – original draft
Affiliations Water, Energy and Environmental Engineering Research Unit, University of Oulu, Oulu, Finland, Department of Civil Engineering, Antalya Bilim University, Antalya, Turkey
Abstract
To reduce the problem of sedimentation in open channels, calculating flow velocity is critical. Undesirable operating costs arise due to sedimentation problems. To overcome these problems, the development of machine learning based models may provide reliable results. Recently, numerous studies have been conducted to model sediment transport in non-deposition condition however, the main deficiency of the existing studies is utilization of a limited range of data in model development. To tackle this drawback, six data sets with wide ranges of pipe size, volumetric sediment concentration, channel bed slope, sediment size and flow depth are used for the model development in this study. Moreover, two tree-based algorithms, namely M5 rule tree (M5RT) and M5 regression tree (M5RGT) are implemented, and results are compared to the traditional regression equations available in the literature. The results show that machine learning approaches outperform traditional regression models. The tree-based algorithms, M5RT and M5RGT, provided satisfactory results in contrast to their regression-based alternatives with RMSE = 1.184 and RMSE = 1.071, respectively. In order to recommend a practical solution, the tree structure algorithms are supplied to compute sediment transport in an open channel flow.
Figures
Fig 2
Fig 3
Table 1
Table 2
Table 3
Fig 1
Table 4
Table 5
Fig 2
Fig 3
Table 1
Table 2
Table 3
Citation: Gul E, Safari MJS, Torabi Haghighi A, Danandeh Mehr A (2021) Sediment transport modeling in non-deposition with clean bed condition using different tree-based algorithms. PLoS ONE 16(10): e0258125. https://doi.org/10.1371/journal.pone.0258125
Editor: Zaher Mundher Yaseen, Ton Duc Thang University, VIET NAM
Received: June 25, 2021; Accepted: September 20, 2021; Published: October 8, 2021
Copyright: © 2021 Gul et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data used in the present study are available from the following DOIs. https://doi.org/10.1061/(ASCE)PS.1949-1204.0000335 https://doi.org/10.1080/00221686.2020.1780501 https://doi.org/10.2166/wst.2020.154.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
For the hydraulic design of urban drainage systems, the sediment transport process must be considered. Flow, fluid, sediment, and channel characteristics related to the sedimentation issues should be considered in order to design wastewater and sewage pipes [1, 2]. Channels are designed to minimize the deposition of sediment depending on different success criteria. Self-cleansing is a concept used in channel construction to encompass non-deposition requirements [3]. Non-deposition involves three criteria: incipient deposition (ID), non-deposition with a clean bed (NCB) and non-deposition with a deposited bed (NDB). The NCB criterion can be implemented by adapting the flow shear stress or velocity to satisfy the clean bed condition [4, 5]. ID is the moment when suspended particles begin to settle. In addition, in NDB criterion, an appropriate deposited bed depth is used to decrease the channel building costs [6, 7].
The non-deposition sediment transport equations were recommended to ensure the clean bed criterion by adopting the required velocity or sediment concentration, as shown in Table 1. The study conducted by [8] documented how the size of the granular material has a substantial effect on the transportation volume of sediment. The non-deposition condition in suspended sediment transport has already been examined by [9–11]. Comprehensive experiments on bed load sediment transport showed that the design velocity is increased as the pipe dimension is expanded [12]. The utility of previously recommended sewer design formulas was evaluated by [13–15]. [16] examined the Camp approach to sewer design and showed that the flow velocity was greatly overestimated. Experiments were carried out in a large pipe, and new self-cleansing models were recommended by [17]. [2] utilized [18] experimental data conducted in five different cross-section channels of rectangular, trapezoidal, V-bottom and U-shape and developed a self-cleansing model considering channel cross-section shape.
[Figure omitted. See PDF.]
Table 1. Traditional regression equations in literature for the sediment transport self-cleansing condition.
https://doi.org/10.1371/journal.pone.0258125.t001
Owing to the robustness of machine learning algorithms, their application on sediment transport modeling has attracted great interest in the literature [3, 19–22]. Various machine learning algorithms have recently been suggested for modeling open channel sediment transport over conventional regression equations. However, utilizing a limited data range for model development is the main limitation of previous studies. For example, different algorithms were applied to model sediment transport in an open channel, such as classification-based [23–25], tree-based [26], network-based [27–29] and evolutionary algorithms [30–33].
It is known that experimental data range, implemented machine learning algorithm and using effective parameters for model development based on the physics of the problem, are quite essential factors to construct a robust machine learning model. As an extension of the existing studies in the literature, in order to promote the modeling of sediment transport in non-deposition with clean bed condition of sediment transport all aforesaid factors are considered in this study. The improved M5 rule tree (M5RT) and M5 regression tree (M5RGT) have been used for solving numerous engineering problems [35, 36]. M5RT and M5RGT are useful for generating a rules-based and fingerprint models from a data set. Thus, in this study, tree-based algorithms of improved M5RT and M5RGT are used to model sediment transport in open channels. While majority of studies in the literature utilized limited amount of data for modeling, this study utilized six data sets having wide ranges of channel size and shape, sediment median size and volumetric concentration, channel bed slope and flow depth. Relying on the hydraulics of sediment transport, models are developed through considering fluid, flow, channel, and sediment characteristics.
2. Methodology
2.1. Data preparation
Experimental studies reported for NCB condition taken from the literature are used in this study. The large data set used in this study was compiled from [12, 16–18, 37, 38]. [37] performed experiments in two cross-sectional shapes, circular and rectangular. The tests were conducted with six different sizes of granular materials in a range between 0.5 to 8.74 mm. In the experiments [38] conducted tests in a circular pipe using granular material with a size of 0.73 mm. [12] conducted experiments with circular cross-sectional shapes, with granular material sizes ranging from 0.46–8.3 mm utilized in the experiments. [16] performed experiments in circular cross-sectional shape with two channels and three different granular materials with sizes between 0.2–0.43 mm. For more detail on this experimental data, the interested reader may refer to [29, 39].
As a novel contribution, the data of [17 and 18] are included in this study. The data used in these two studies enhances the modeling reliability. Thus, the use of data from [18] makes this study more reliable in obtaining experimental data with a variety of cross-sectional shapes, with experiments performed in trapezoidal, circular, U-shape, rectangular and V-shape bottom channels. Also [17], who used a large-diameter (595 mm) circular cross-sectional channel, makes this study more reliable than the models in the literature. The ranges of data sources are given in Table 2.
[Figure omitted. See PDF.]
Table 2. Range of data used in the present study.
https://doi.org/10.1371/journal.pone.0258125.t002
It is demonstrated by [1, 2] that four characteristics of fluid, flow, channel, and sediment must be embedded to a sediment transport model. As already reported in the literature, self-cleansing models can be influenced by the following parameters;(6)where V is flow velocity, g is gravity acceleration, ρ is fluid specific mass, υ is fluid kinematic viscosity, R is hydraulic radius, d is median size of sediment, λ is channel friction factor, Cv is sediment volumetric concentration, and ρs is sediment specific mass. These parameters can be considered effective sediment transport variables for the modeling. The following equation is written taking the above variables into consideration as a group of dimensionless parameters:(7)where Dgr and Frp are the dimensionless grain size and particle Froude number parameter, respectively defined as:(8)(9)where s is relative particle mass (s = ρs/ρ). Table 3 shows statistical characteristics of the utilized data. In this study, the data set was divided into 70% training and 30% testing data. Based on the findings published by [40], who performed uncertainty analysis, the split between training and test data does not make a major difference to model performance; however, the best data split rate was reported as 70% and 30% for training and testing periods.
[Figure omitted. See PDF.]
Table 3. Statistical characteristics of the data used for modeling.
https://doi.org/10.1371/journal.pone.0258125.t003
2.2. M5P classifiers
2.2.1. M5 regression tree.
The M5 Model Trees are a state-of-the-art algorithm that effectively divides the sample area into subspaces and forms linear regression models in pieces for each subspace [41]. Model trees are a more general form of regression tree, with constant values as their leaves [42]. M5’, also known as M5P, is an improved version of the Classification and Regression Tree (CART). A task was done to reduce the number of trees in the original M5P algorithm. It differs from other tree-based solutions due to the use of linear functions in the leaves. The linear function used at the decision tree nodes divides the tree into leaves to form the model tree. Tree-based models are constructed using a divide and conquer method.
The model tree is created in three stages. The first stage involves branching according to the splitting criterion. The branching criterion depends on the value of the standard deviation of the attribute value. The attribute that reduces the expected error is chosen as the root of the tree. The formula for standard deviation reduction (SDR) is calculated as in Eqs (10–12).(10)(11)(12)where n is the number of training examples at the node, T is a set of attributes that reaches the node, is the average value of the sets of T attribute and sd(T) is the standard deviation of T.
The second stage is tree pruning. Each leafless node of the model tree is examined, starting from below, for the pruning stage, as in Eq (13). The M5 algorithm selects the final model for this node of simplified linear model or model subtree, depending on the minimized estimated error rate. The final stage is tree smoothing.(13)where n is the number of training examples at the node and v is the number of parameters representing class value at the node.
The difference between real class value and predicted value is averaged in every training example to reach the node for the pruning process in the M5 algorithm. This average value is multiplied by this coefficient. The M5 Regression tree operates in the same way as M5P model tree in all steps. The value of subspaces that act as dividers is not a linear equation in the RGT model (Fig 1).
[Figure omitted. See PDF.]
Fig 1. Schematic for the M5P method.
https://doi.org/10.1371/journal.pone.0258125.g001
2.2.2. M5 rule tree.
M5 rule generates the rules from the M5 tree, based on the CART algorithm that was presented by [43]. The method for generating rules from model trees, which we call M5’ Rules, works as follows: a tree learner (in this case model trees) is applied to the whole training data and a pruned tree is developed. Next, the best leaf is made into a rule. All samples covered by the rule are removed from the dataset. The process is applied recursively to the remaining samples and terminates when all samples are covered by one or more rules. In contrast to CART, which employs the same strategy for categorical prediction, M5’ Rules builds full trees, instead of partially explored trees. All modeling was done in a Matlab 2016B environment [44]. Table 4 shows model parameters for M5RGT and M5RT.
[Figure omitted. See PDF.]
Table 4. Model parameters for M5RGT and M5RT.
https://doi.org/10.1371/journal.pone.0258125.t004
2.3. Performance criteria
The analysis of model performance is important for investigating the credibility of the models. Thus, the coefficient of determination (R2), variance account for (VAF), root mean square error (RMSE), Nash–Sutcliffe model efficiency coefficient (NASH), and a10-index are used in this study to determine the degree of fit index. The model performs well with R2, NASH and a10-index close to the unity, VAF close to 100 and RMSE close to zero. The R2, VAF, RMSE, NASH and a10-index can be computed using the following equations, respectively:(14)(15)(16)(17)(18)where, pj is the observed value, is the predicted value, subscript of t indicates the mean value, m is the data number, K is the total number in the dataset, and k10 is the number of samples in the case with a ratio of measured values to predicted values between 10% error (0.9 < observed/predicted < 1.1). Each performance criterion examines specific feature of the developed model as R2 shows the correlation, RMSE scattering rates, VAF variance variation, NASH data skewness and a10-index gives information about model reliability.
3. Results
The performances of two tree-based algorithms, M5RT and M5RGT, are compared by means of the different statistical error measurement criteria of R2, VAF, RMSE, NASH and a10-index in Table 5. Performance of M5RT and M5RGT models recommended in this study are examined in comparison to four empirical equations of [2, 12, 16, 17, 34]. [12 and 16] equations are selected due to their credibility as reported in [39]. Furthermore, recently reported equations of [2, 17, 34] are used for model performance evaluation. The results showed that M5RGT is slightly superior to M5RT, in terms of different statistical performance criteria. Both models proposed in this study performed better than traditional equations. Among regression models, [16] generates the poorest results, while [2] provide better results in contrast to other regression equations. A comparison of best machine leaning models of M5RGT with RMSE of 1.071 with the best regression equation of [2] with RMSE of 1.350 illustrates an almost 21% improvement on the accuracy of the model in M5RGT algorithm.
[Figure omitted. See PDF.]
Table 5. Model performances with different statistical indexes.
https://doi.org/10.1371/journal.pone.0258125.t005
Over-fitting and under-fitting are of great importance in determining the accuracy of the model. Over-fitting means the model memorizes while training. Therefore, test performance is significantly worse than training performance. On the other hand, in the case of under-fitting, test results are better than training results. The best-case scenario is the model performance in the training and testing phases being close to each other. Table 5 shows how the results of four different models in the training and testing phases have no significant differences, so it can be said that the developed models in this study have no such deficiencies.
Fig 2 shows the scatterplot of the observed and predicted values for different models during the training phase. These are two major problems for the underestimation or overestimation of sediment transport models. If the predicted value is greater than the observed value, an overestimation problem occurs; otherwise, if predicted values are lower than the actual counterparts, underestimation will occur. An overestimated model could not be an economical design method as it causes open channels to be designed larger than required. An underestimated model, on the other hand, causes the channel to be designed without sufficient planning criteria. As shown in Fig 2, bisector lines for both models are passed through the middle of the data clouds showing that developed models in this study have no significant over- or underestimation.
[Figure omitted. See PDF.]
Fig 2. Scatter plots of observed and predicted particle Froude numbers for the training stage.
https://doi.org/10.1371/journal.pone.0258125.g002
The visually comparison of M5RT and M5RGT with conventional equations in the form of scatter plots are shown in Fig 3. It can be seen from Fig 3 that M5RT and M5RGT outcomes are close to the best fit line, showing their accuracy where, except a few data points, most of the data falls within the ±10% lines. In the case of regression equations, a significant scatter has been seen, showing their deficiency in accurate sediment transport prediction. The equations of [12, 16, 17] generate considerable overestimation, while the equation of [2] shows a slight underestimation.
[Figure omitted. See PDF.]
Fig 3. Scatter plots for observed and predicted particle Froude numbers for the testing stage.
https://doi.org/10.1371/journal.pone.0258125.g003
4. Discussions
As noted earlier, machine learning models for sediment transport are superior to traditional regression equations. While traditional regression equations are more practical than machine learning methods, they have not produced reliable models. For this reason, model trees methods with practical application areas were proposed as an alternative to classical regression equations in this study (their codes are given as S1 Table).
The data range is quite important for the reliability of the model for sediment transport in open channels. In this study, a more comprehensive data set than the data used in studies found in the literature is used for the first time. The wide range of data makes it difficult to establish an accurate and reliable model. The methods (M5RT and M5RGT) used in this study performed better than the methods recommended in the literature, although the utilized data has a considerable wide range. This study examines the performance of traditional equations when using this large data set. According to the traditional regression equations’ performances, the [2] equation was found to have the best performance in contrast to the other equations. Modeling such a large data set can reveal the problem of overestimation and underestimation. In particular, it was observed that the traditional regression equations overestimated, except for the [2] equation, which has a slight underestimation. Additionally, it was found that traditional equations are overestimated models for analyzing sediment transport in self-cleansing design using limited data sets [29, 45].
In this study, a new index, a10-index, is also used. The a10-index shows the amount of data between the 10th percentiles. Examining the a10-index performances of the models, together with the RMSE, enabled more reliable models to be obtained. Examining such a large data set is important in terms of observing the outlier effect. M5RT was superior to M5RGT in terms of a10-index performance criterion. As can be understood from the results obtained in this study, the outlier performance is found to be better in M5RGT model.
It must be noticed that over-fitting is an essential issue in application of machine learning algorithms [29, 46]. Incorporating a few number of input variables and weights to construct a well-organized model works out over-fitting issue [47]. An over-fitted model fits on train data set perfectly and provides weak results on test data set [48]. The recommended models in this study for sediment transport modeling are not expected to encounter with over-fitting issue as developed models perform well on test data set. Most importantly, the developed models in this study are constructed on large data sets and model parameters are elected relying on the physics of the sediment transport in open channel flow.
The results obtained in this study illustrate that the recommended M5RT and M5RGT models give better outcomes in comparison to the empirical equations of [2, 12, 16, 17]. Implementation of robust machine leaning algorithms for solving complex and difficult hydraulic problems such as sediment transport is inevitable. However, considering effective variables based on the physics of the problem, and experimental data range are of prominent importance to get reliable results. While most of the studies in the literature used a few data sets for modeling sediment transport at non-deposition with clean bed condition, this study extends the available studies in the literature through utilizing wider range of experimental data taken from six sources which cover wide ranges of channel size and shape, sediment median size and concentration and flow depth. As a result, it seems that this study favorably developed sediment transport models which can be used in practice for channel design purpose.
In this study, the laboratory data collected from the literature was used for sediment transport modeling. The authors acknowledge that more reliable modeling could be done using the field data obtained in practice. Although tree-based algorithms produce solutions that can be used in practice, using alternative methods, such as symbolic regression or bagged tree, is recommended in future studies. In addition, a detailed examination of the outlier effect will make important contributions to future studies.
5. Conclusions
In this study, modeling of the sediment transport in open channels is conducted using two different tree-based algorithms, M5 rule tree (M5RT) and M5 regression tree (M5RGT). The six existing data sets with a wide range compiled from the literature are used for model development. The proposed algorithms, M5RT and M5RGT, are compared to traditional regression equations, in particle Froude number prediction. Our results indicated that the M5RGT outperforms M5RT with RMSE = 1.071 and RMSE = 1.184, respectively. Modeling results show that the proposed algorithms are superior to the traditional equations. According to traditional regression equations’ performances, most of the models show significant overestimation, demonstrating their deficiency in terms of economical design benchmarks, where channels need steeper bed slope to satisfy the non-deposition sediment transport condition. In addition, a new index, a10-index, is presented in this study to enhance the model performance examination. The tree structures are presented explicitly and are expected to provide practical solutions for users. Future research directions can be considered based on the limitation of this study in terms of utilized data for the modeling. Conducting filed studies to collect data from real sewers and drainage systems through incorporating cohesive sediment particle characteristics may improve the credibility of the developed models.
Supporting information
S1 Table. M5 regression tree (M5RGT) structure.
https://doi.org/10.1371/journal.pone.0258125.s001
(DOCX)
Citation: Gul E, Safari MJS, Torabi Haghighi A, Danandeh Mehr A (2021) Sediment transport modeling in non-deposition with clean bed condition using different tree-based algorithms. PLoS ONE 16(10): e0258125. https://doi.org/10.1371/journal.pone.0258125
1. Robinson M.P. Jr, Graf W.H., 1972. Pipelining of Low-Concentration Sand-Water Mixtures. Journal of the Hydraulics Division, 98(7): 1221–1241.
2. Safari M.J.S., Aksoy H., 2021. Experimental analysis for self-cleansing open channel design. Journal of Hydraulic Research, 59(3): 500–511.
3. Safari M. J. S., Danandeh Mehr A., 2018. Multigene genetic programming for sediment transport modeling in sewers for conditions of non-deposition with a bed deposit. International Journal of Sediment Research, 33(3), 262–270.
4. Ota J.J., Nalluri C., 2003. Urban storm sewer design: Approach in consideration of sediments. Journal of Hydraulic Engineering, 129(4): 291–297.
5. Safari M.-J.-S., Aksoy H., Unal N.E., Mohammadi M., 2017. Non-deposition self-cleansing design criteria for drainage systems. Journal of Hydro-environment Research, 14: 76–84.
6. Nalluri C., El-Zaemey A.K., Chan H.L., 1997. Sediment transport over fixed deposited beds in sewers—An appraisal of existing models. Water Science and Technology, 36(8–9): 123–128.
7. Safari M.J.S., Shirzad A., 2019. Self‐cleansing design of sewers: Definition of the optimum deposited bed thickness. Water Environment Research, 91(5): 407–416. pmid:30731037
8. Pedroli R., 1963. Bed load transportation in channels with fixed and smooth inverts. Mitteillung des Eidg. Amtes fur Wasserwirtschaft, Dienst Exemplar(43).
9. Arora, A., 1983. Velocity distribution and sediment transport in rigid bed open channels, Ph. D. Thesis, University of Rookee, Rookee.
10. Macke E., 1982. About sedimentation at low concentrations in partly filled pipes. Mitteilungen, LeichtweissÐInstitut fuÈ r Wasserbau der Technischen, UniversitaÈ t Braunschweig.
11. Nalluri C., Spaliviero F., 1998. Suspended sediment transport in rigid boundary channels at limit deposition. Water Science and Technology, 37(1): 147–154.
12. Ghani Ab, 1993. Sediment transport in sewers, Newcastle University.
13. May R.W., Ackers J.C., Butler D., Siân J., 1996. Development of design methodology for self-cleansing sewers. Water Science and Technology, 33(9): 195.
14. De Sutter R. et al., 2003. Validation of existing bed load transport formulas using in-sewer sediment. Journal of Hydraulic Engineering, 129(4): 325–333.
15. Butler D., May R., Ackers J., 2003. Self-cleansing sewer design based on sediment transport principles. Journal of Hydraulic Engineering, 129(4): 276–282.
16. Vongvisessomjai N., Tingsanchali T., Babel M.S., 2010. Non-deposition design criteria for sewers with part-full flow. Urban Water Journal, 7(1): 61–77.
17. Montes C., Vanegas S., Kapelan Z., Berardi L., Saldarriaga J., 2020. Non-deposition self-cleansing models for large sewer pipes. Water Science and Technology, 81(3): 606–621. pmid:32385214
18. Safari, M., 2016. Self-cleansing drainage system design by incipient motion and incipient deposition-based models, PhD Thesis, Istanbul Technical University, Turkey.
19. Alizadeh M.J., Nodoushan E.J., Kalarestaghi N., Chau K.W., 2017. Toward multi-day-ahead forecasting of suspended sediment concentration using ensemble models. Environmental Science and Pollution Research, 24(36): 28017–28025. pmid:28993996
20. Chen X.-Y., Chau K.-W., 2019. Uncertainty analysis on hybrid double feedforward neural network model for sediment load estimation with LUBE method. Water Resources Management, 33(10): 3563–3577.
21. Reisenbüchler M., Bui M.D., Rutschmann P., 2021. Reservoir Sediment Management Using Artificial Neural Networks: A Case Study of the Lower Section of the Alpine Saalach River. Water, 13(6): 818.
22. Sharafati A., Haji Seyed Asadollah S.B., Motta D., Yaseen Z.M., 2020. Application of newly developed ensemble machine learning models for daily suspended sediment load prediction and related uncertainty analysis. Hydrological Sciences Journal, 65(12): 2022–2042.
23. Ebtehaj I., Bonakdari H., Shamshirband S., Ismail Z., Hashim R., 2016a. New approach to estimate velocity at limit of deposition in storm sewers using vector machine coupled with firefly algorithm. Journal of Pipeline Systems Engineering and Practice, 8(2): 04016018.
24. Ebtehaj I., Bonakdari H., Shamshirband S., Mohammadi K., 2016b. A combined support vector machine-wavelet transform model for prediction of sediment transport in sewer. Flow Measurement and Instrumentation, 47: 19–27.
25. Roushangar K., Ghasempour R., 2016. Prediction of non-cohesive sediment transport in circular channels in deposition and limit of deposition states using SVM. Water Science and Technology: Water Supply, 17(2): 537–551.
26. Ebtehaj I., Bonakdari H., Zaji A.H., 2016c. An expert system with radial basis function neural network based on decision trees for predicting sediment transport in sewers. Water Science and Technology, 74(1): 176–183. pmid:27386995
27. Azamathulla H.M., Ghani A.A., Fei S.Y., 2012. ANFIS-based approach for predicting sediment transport in clean sewer. Applied Soft Computing, 12(3): 1227–1230. pmid:22389640
28. Najafzadeh M., Bonakdari H., 2016. Application of a neuro-fuzzy GMDH model for predicting the velocity at limit of deposition in storm sewers. Journal of Pipeline Systems Engineering and Practice, 8(1): 06016003.
29. Safari M.J.S., Ebtehaj I., Bonakdari H., Es-haghi M.S., 2019. Sediment transport modeling in rigid boundary open channels using generalize structure of group method of data handling. J Hydrology, 577: 123951.
30. Ab Ghani A., Azamathulla H.M., 2010. Gene-expression programming for sediment transport in sewer pipe systems. Journal of Pipeline Systems Engineering and Practice, 2(3): 102–106.
31. Ebtehaj I., Bonakdari H., 2016. Assessment of evolutionary algorithms in predicting non-deposition sediment transport. Urban Water Journal, 13(5): 499–510.
32. Najafzadeh M., Laucelli D.B., Zahiri A., 2017. Application of model tree and evolutionary polynomial regression for evaluation of sediment transport in pipes. KSCE Journal of Civil Engineering, 21(5): 1956–1963.
33. Roushangar K., Ghasempour R., 2017. Estimation of bedload discharge in sewer pipes with different boundary conditions using an evolutionary algorithm. International Journal of Sediment Research, 32(4): 564–574.
34. Tafarojnoruz A., Sharafati A., 2020. New formulations for prediction of velocity at limit of deposition in storm sewers based on a stochastic technique. Water Science and Technology, 81(12): 2634–2649. pmid:32857749
35. Ayaz Y., Kocamaz A.F., Karakoç M.B., 2015. Modeling of compressive strength and UPV of high-volume mineral-admixtured concrete using rule-based M5 rule and tree model M5P classifiers. Const Build Mat, 94: 235–240.
36. Fang Q., Nguyen H., Bui X.-N., Nguyen-Thoi T., 2020. Prediction of blast-induced ground vibration in open-pit mines using a new technique based on imperialist competitive algorithm and M5Rules. Natural Resources Research, 29(2): 791–806.
37. Mayerle R., 1988. Sediment transport in rigid boundary channels, University of Newcastle upon Tyne.
38. May R.W., 1993. Sediment transport in pipes, sewers and deposited beds.
39. Safari M.J.S., Mohammadi M., Ab Ghani A., 2018. Experimental studies of self-cleansing drainage system design: a review. Journal of Pipeline Systems Engineering and Practice, 9(4): 04018017.
40. Ebtehaj I. et al., 2020. Combination of sensitivity and uncertainty analyses for sediment transport modeling in sewer pipes. International Journal of Sediment Research, 35(2): 157–170.
41. Quinlan J.R., 1992. Learning with continuous classes, 5th Australian joint conference on artificial intelligence. World Scientific, pp. 343–348.
42. Wang Y., Witten I.H., 1996. Induction of model trees for predicting continuous classes.
43. Frank E., Witten I.H., 1998. Generating accurate rule sets without global optimization.
44. Jekabsons, G., 2016. M5’regression tree, model tree, and tree ensemble toolbox for Matlab/Octave ver. 1.7. 0.
45. Kargar K., Safari M.J.S., Khosravi K., 2021. Weighted instances handler wrapper and rotation forest-based hybrid algorithms for sediment transport modeling. J Hydrology: 126452.
46. Fathian F., Mehdizadeh S., Sales A.K., Safari M.J.S., 2019. Hybrid models to improve the monthly river flow prediction: Integrating artificial intelligence and non-linear time series models. J Hydrology, 575: 1200–1213.
47. Chang F.-J., Kao L. s., Kuo Y.-M., Liu C.-W., 2010. Artificial neural networks for estimating regional arsenic concentrations in a blackfoot disease area in Taiwan. J Hydrology, 388(1–2): 65–76.
48. Bishop C.M., 1995. Neural networks for pattern recognition. Oxford university press.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2021 Gul et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
To reduce the problem of sedimentation in open channels, calculating flow velocity is critical. Undesirable operating costs arise due to sedimentation problems. To overcome these problems, the development of machine learning based models may provide reliable results. Recently, numerous studies have been conducted to model sediment transport in non-deposition condition however, the main deficiency of the existing studies is utilization of a limited range of data in model development. To tackle this drawback, six data sets with wide ranges of pipe size, volumetric sediment concentration, channel bed slope, sediment size and flow depth are used for the model development in this study. Moreover, two tree-based algorithms, namely M5 rule tree (M5RT) and M5 regression tree (M5RGT) are implemented, and results are compared to the traditional regression equations available in the literature. The results show that machine learning approaches outperform traditional regression models. The tree-based algorithms, M5RT and M5RGT, provided satisfactory results in contrast to their regression-based alternatives with RMSE = 1.184 and RMSE = 1.071, respectively. In order to recommend a practical solution, the tree structure algorithms are supplied to compute sediment transport in an open channel flow.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer