Content area
During the long-term operating period, the mechanical parameters of hydraulic structures and foundation deteriorated gradually because of the environmental factors. In order to evaluate the overall safety and durability, these parameters should be calculated by some accurate analysis methods, which are hindered by slow computational efficiency and optimization performance. The improved deep Q-network (DQN) algorithm combined with the deep neural network (DNN) surrogate model was proposed in this paper to ameliorate the above problems. Through the study cases of different zoning in the dam body and the actual engineering foundation, it is shown that the improved DQN algorithm has a good application effect on inversion analysis of material mechanical parameters in this paper.
This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
The premier task is to monitor the safe status of structures during the operating period. There have been catastrophes of engineer crash from time to time around the world due to the lack of overall monitoring methods and the low analysis accuracy of calculating methods. A disastrous example is that the dam Edenville broke, and the leaking flood shattered both Smallwood dam and Sanford dam subsequently in the downstream position, which caused serious damage to surrounding cities.
The hydraulic project crashes happen mainly because of the collapse of the dam body and the sliding of the foundation or abutment. During the operating term, the concrete dam is affected by environmental factors obviously. At the microlevel, there are physical and chemical reactions in the parameters of the dam body material and foundation material, so their mechanical parameters deteriorated gradually, leading to the increase of structure displacement or leakage at the macrolevel. Both the deformation of the dam body or foundation and the leakage of the concrete structure are key monitoring targets. The deformation monitoring includes forward analysis and inversion analysis. The former is to map the linear or nonlinear relation between environmental loads and displacement by establishing a regress model [7–9], whose target is predicting the status of the engineering and environment nearby in the future. The latter is to check the strength and the stability according to the mechanical parameters of structures or foundations by calculating the data of structural operating state combined with the data of the environmental variation [10].
Because the constitutive models of practical engineering are all nonlinear, it is impossible to work out the problems directly. By calculating the maximum or minimum value of target functions, the heuristic algorithms became the main methods to optimize parameters in the feasible region. Particle swarm optimization (PSO) algorithm and genetic algorithm were applied to optimize the structural parameters in the early time [11]. Kang introduced the artificial bee algorithm in 2013 [12]. And he optimized the models by combining heuristic algorithm with machine learning algorithm in 2016 [13–15]. After that, he improved firework algorithm and obtained better effect in identifying parameters [16]. Besides, Lin carried out inversion calculation with wolf pack algorithm, and the resultant accuracy was higher than whale optimization algorithm.
There are two main problems existing in the inversion analysis of hydraulic structures. The first one is that the current displacement inversion method is based on the finite element method (FEM). Under the combination of different mechanical parameters and environmental loads, the nodes’ displacements are calculated by the finite element model. With the growing number of parameters, the calculating dimensions rise synchronously. Besides, the time complexity of the finite element model increases sharply with more grids. The two factors could lead to the result that the calculating convergence time is so long that the feasibility of application in the practical project is low. The second one is that although so many heuristic algorithms provide the possibility to implement global search in the feasible region, these methods calculate and compare the target values after sampling practical points in the parameter space, so they could not guarantee the best consequence in the multidimensional parameter space and have poor convergence in practice.
Recently, machine learning algorithm with a positive developing trend includes three parts, which are supervised learning, unsupervised learning, and reinforcement learning (RL) [17]. As the cutting-edge branch, RL differs from the other ones. It is a learning algorithm with delay effect, seeking the best policy with dynamic programming [17]. The core idea is that the agent tries different policies to select corresponding actions under diverse state from the environment during the interactive process between the agent and the environment, so the agent could find the best action to maximize the reward when facing different states after the learning stage [18]. RL adopts the way of exploring from the beginning time and then utilizing the exploratory experience to complete the trail-and-error process [19]. Bellman proposed a dynamic method to deal with the value function based on the information from the systematic state [20], but the curse of dimensionality occurred when the method was applied, which was solved effectively by Mes and Rivera [21]. Some scholars introduced the function approximation method to access the value when the state and action were consecutive, such as the linear function and artificial neural network [23, 24]. With the gradual development of RL theory, these relative technologies had made great progress in the industry. Zhiang Zhang et al. reduced the indoor energy consumption by 16.7% by optimizing the HVAC system with deep reinforced learning algorithm [25]. Zhe Wang and Hong discussed the contribution and current obstacles when RL was adopted in controlling buildings [26]. The industry of robot employed RL to control the mechanical action accurately [27–30]. Fangyuan Chang et al. achieved the goal of reducing cost in the charging battery by combining RL and LSTM [31].
To improve two inversion problems with machine learning mentioned above in the paper, the DNN surrogate model and reinforcement learning are introduced into the structural inversion calculation for the first time. The deep neural network completes the learning stage with training samples which are the calculating results from the FEM, which makes the DNN model replace the finite element model to map the target points’ displacements approximately and improve the convergence efficiency greatly under the premise of ensuring the calculating accuracy. The basic theory of reinforcement learning guarantees the convergence of the algorithm. The inversion calculation of structural material parameters with monitoring data is a Markov process. Its core is working out the best value of a nonlinear function in the global parameter space. Taking the monitoring data as a part of the observable environmental state, the inversion calculation and optimization of structural material parameters can be realized through reinforcement learning combined with the engineering’s deep learning surrogate model. This paper adopts the punitive idea which is a negative reinforcement mode to form deep reinforcement learning algorithm by combining the target of inversion calculation and the DNN surrogate model with reinforcement learning. Besides, the interactive mode of information between the agent and the environment is improved to adapt to the optimization of material parameters of engineering structures and the surrounding foundation. The last part is to employ a new mode to express the displacement relativity among different monitoring points from the same structural sections to make the deep reinforcement learning algorithm adapt to the inversion calculation of multiple zones, to ensure the coordination among the parameters among all zones in the same section, so that this algorithm could get a wider application to introduce a new mode for the hydraulic inversion analysis.
2. Theory
2.1. The Inversion Theory of Mechanical Parameters
The elastic modulus is calculated inversely by the relation between the monitoring data of dam deformation and those of the environment. According to the monitoring theory [32], the displacement along the river of the dam body, disp, consists of the water pressure component
The water pressure component
The inversion analysis is to seek the suitable mechanical parameters to minimize the error
2.2. DNN Surrogate Model
A three-layer network structure with a suitable activation function could approximate any function infinitely in theory with reasonable number of iterative epochs [33]. According to equation (1), the factors affecting the displacement of nodes
[figure omitted; refer to PDF]
The main idea of the
The original reinforcement learning usually adopts linear transformation or look-up table method, which could not solve multidimension or nonlinear problems. DQN algorithm that combines deep learning algorithm and reinforcement learning not only obtains excellent characterization capability of deep learning to transform the data features into the state as the input of the Agent but also selects the proper action a by calculating all feasible state-action values
(1) Storage: store the past data [st, at, rt, st+1] in the memory zone as samples
(2) Sample and replay: extract multiple samples [st, at, rt, st+1] from each batch as the input data of the deep network
During the iterative process of Q-learning, the parameters of the state-action value function operating in the t time step are the same as those operating in the t + 1 time step, which results in synchronous ups and downs of the q value in two time steps, enhancing the probability of model divergence. So, the actor-critic framework was introduced. The actor is expressed as
2.4. Combination of the Improved DQN and Inversion Calculation
The target of mainstream RL is to develop the best policy to guide Agent to select proper a when facing different states from Env and obtain the highest accumulated reward, while the task of inversion calculation is to select an elastic modulus that is suitable for the deformation of the engineering structure and foundation. So, the interactive mode of information between Agent and Env is improved: after Agent selects a proper action a according to state s, Env assesses this a, and in the meantime, this a improves the parameters of the state to search the best material mechanical parameters.
2.4.1. Construction of the Inversion Agent
The DNN surrogate model, established according to Section 2.2, is used to calculate the agent displacement
[figure omitted; refer to PDF]
In summary, this paper adopts the improved DQN algorithm embedded with the DNN surrogate model. Agent completes the task to adjust E in the state from Env to minimize the absolute error (maximize the reward) calculated by agent
2.4.3. Relation of Inversion in Multizones
In different zones in the dam section, relevance among the displacements of nodes, to a certain degree, exists without causality. So, it is unsuitable to adopt equation (16) to adjust parameters in all zones by identical adjustment extent, and it is also unreasonable to adjust the parameter only in one zone corresponding to the current sample, ignoring the relevance among deformation of all zones. With the action of upstream water pressure, the whole section of the dam body demands for the deformation coordination. For example, in Figure 6, the displacement of node PA in the upper zone is related to not only the mechanical parameters in zone Ω1 but also those in zone Ω2. The relevance is expressed with the following equation:
[figure omitted; refer to PDF]
When a sample adjusts the mechanical parameters in other zones, the adjustment factor is
3. Case Study
3.1. Inversion Calculation of the Single Dam Zone: Case A
This case A is to minimize the cumulative absolute error of the agent displacement
Step 1: establish the finite element model. The finite element model is shown in Figure 7, containing two components, dam and foundation. The horizontal direction x is along the river, and the vertical direction y is the elevation. The dam height is 107.5 m, the length of dam bottom is 88 m, and the length and width of the dam foundation are 488 m and 300 m, respectively. All mechanical parameters of the model are listed in Table 1. EA indicates the elastic modulus of the dam section. The nodes of foundation bottom are fixed in the horizontal and vertical direction, and the nodes at both sides of the foundation are fixed in the vertical direction.
[figure omitted; refer to PDF]
Table 1
The mechanical parameters of case A.
| Component | Density (kg/m3) | Elastic modulus (GPa) | Poisson’s ratio |
| Dam body | 2400 | EA | 0.167 |
| Foundation | 2400 | 9 | 0.167 |
Step 2: select the sample. 259 different water levels were extracted randomly from 86.5 m to 104.3 m, and 200 different elastic moduli EA were randomly extracted from 5 GPa to 20 GPa evenly, not containing 10.3 GPa (the target parameter). There were 51,800 groups of combination states of mechanical parameter and water pressure. The model was calculated using software GeHoMadrid developed by Hohai University and Universidad Politécnica de Madrid to get the node displacement of all states. The result
Step 3: construct the DNN surrogate model. The model had four layers, established by Keras. The first layer had four input nodes, named input-EHXY, and the form of the input vector is [EA, H, x, y]. The second and third layers were fully connected layers, named Sec-layer and Third-layer, with 8 and 10 nodes, respectively. The fourth layer was the output layer named dispout with 1 node, and the calculating target was
[figure omitted; refer to PDF]
The samples from step 2 were shuffled randomly, and all data were normalized to [0, 1] according to the data features. Training samples occupied 80%, and the rest were verifying samples. The changing horizontal section and deformable foundation have an effect on the displacement in the dam. And the increase of altitude weakens the nonlinear effect. The location of node C is in the lower zone and near the foundation, so this zone could illustrate nonlinear deformation more clearly than those nodes in the higher area. Besides, the closer the node is to the foundation, the smaller the displacement is, so node C was selected. The predicting samples were the displacement along the river of node C in Figure 7 calculated under the state that the elastic modulus was 10.3 GPa with 259 water levels above.
The iterative process of the training error and verifying error is shown in Figure 9, where it indicated that, during the former 100 epochs, the two errors decreased sharply to the level close to 0. After 200 epochs, the network parameters were nearly stable. When the training stage was completed, the fixed DNN model was stored to replace the finite model in the later steps.
[figure omitted; refer to PDF]
The displacement of different nodes is related to water level elevation and the elastic modulus of the dam body. According to the monitoring theory [32],
Training samples and predicting samples are the same as those of the DNN model. The calculating results of predicting samples are shown in Table 2.
Table 2
Error of predicting samples in 10.3 GPa.
| Relative error (%) | DNN | MLR |
| Mean | 0.372 | 2.723 |
| Maximum | 1.833 | 16.212 |
From Table 2, the mean relative errors of DNN and MLR were lower than 3%. Furthermore, the accuracy of the DNN model in both mean relative error and maximum relative error was an order of magnitude higher than that of the MLP model. The possible reason was that the MLP model constructed regression factors based on the plane cross-section assumption and complete elastomer assumption, but node C was near the dam foundation, which meant during the calculation, the deformation of the dam body and foundation did not meet the first assumption. The displacement of node C was not completely linear. DNN model was nonlinear, which represented that the neural network could map the relation between environmental load and displacement more efficiently. The maximum relative error was lower than 2%. From this, it was reasonable for the DNN, after being well trained, to replace the finite element model.
Step 4: construct Agent. The Agent included three parts. The first one was the DNN model stored in step 3, which received the state s, [EA, H, x, y] and produced agent displacement
[figure omitted; refer to PDF]
Step 5: calculation with the DQN algorithm. The predicting samples normalized in step 3 were target samples in this step. This maximum number of epoch was 100, and each epoch had 100 time steps. The initial value of probability
3.1.1. Process Analysis
Figures 11 and 12 show that, in the initial period, the model was in the exploration stage, selecting actions randomly, resulting in the fluctuation of the reward. Then, the DQN model moved into the exploitation stage, with the increase of epoch and selecting the right action when facing different states. The absolute value of the reward decreased smoothly, and the searching parameters kept approaching the target value in the former 40 epochs before the model was generally stable. The result of inversion calculation reached the optimal status of the model.
[figure omitted; refer to PDF]
When the interactive process between Agent and Env was completed, the eventual elastic modulus EA was 10.3187 GPa, and the actual target was 10.3 GPa. So, the absolute error was 0.0187 GPa, and the relative error was 0.18%. Two possible reasons of the error were as follows: the first one was that the DNN surrogate model had a mean error of 0.372% relative to the finite element model, and its accuracy could determine the accuracy of DQN; the second reason was that the search method of DQN was not perfect. The error level indicated that the inversion consequence calculated by DQN algorithm was very close to the actual value in case A, which meant the method of this paper had a fine effect on the inversion analysis of the whole dam section.
3.2. Inversion Calculation of Double Dam Zones: Case B
This case B is to minimize the cumulative absolute error of the agent displacement
Step 1: establish the finite element model. The finite element model is shown in Figure 15, containing three components: two zones in the dam section and foundation. The horizontal direction x is along the river, and the vertical direction y is the elevation. The dam height is 50 m, the width of the dam crest is 5 m, the length of dam bottom is 5 m, and the length and width of the dam foundation are 190 m and 100 m, respectively. All mechanical parameters of the model are listed in Table 3. EB1 indicates the elastic modulus of the upper zone, and EB2 indicates the elastic modulus of the lower zone. The nodes of foundation bottom are fixed in the horizontal and vertical direction, and the nodes at both sides of the foundation are fixed in the vertical direction.
[figure omitted; refer to PDF]
The samples from step 2 were shuffled randomly, and all data were normalized to [0, 1] according to the data features, where the first independent variable EB1 and the second one EB2 were normalized with the same scale. Training samples occupied 80%, and the rest were verifying samples. The predicting samples were the displacements along the river of nodes A and B in Figure 15 calculated under the state that the upper elastic modulus was 18.0 GPa and the lower one was 22.0 GPa with 140 water levels above.
The iterative process of the training error and verifying error is shown in Figure 17, where it indicated that, during the former 100 epochs, the two errors decreased sharply to the level close to 0. After the former 200 epochs, the network parameters were nearly stable. After the training stage, the DNN model was stored to replace the finite model in the later steps.
[figure omitted; refer to PDF]
The maximum relative error was 3.56%, and the mean relative error was 0.59%, which indicated that the overall relative error was low. It was reasonable for DNN, after being well trained, to replace the finite element model according to the accuracy.
Step 4: construct Agent. The structure and parameters in Agent were the same as those in case A except that the input layer of the fixed DNN model had 5 nodes.
Step 5: calculation with DQN algorithm. The predicting samples normalized in step 3 were calculated as target samples in this step. This maximum number of epoch was 200, and each epoch had 100 time steps. The variation of random probability
3.2.1. Process Analysis
Different from Figure 11, Figure 18 shows that the zoning reward had been increasing with constant fluctuation during the negative reinforcement stage and then was stable in (−0.2∼0), which indicated that the change of one zone would lead to the fluctuation of another zone. As a result, the agent displacement could not remain steady completely, but the overall trend was increasing, representing that the absolute value of the reward was decreasing, which meant that the penalty from Env was lower and lower and got stable in a certain range. Figure 19 shows the searching parameters kept approaching the target parameters and then tended to be stable. The result of inversion calculation reached the optimal status of the model.
[figure omitted; refer to PDF]
The calculating result of DQN algorithm is listed in Table 4, which showed the relative error in the upper zone was 1.29%, and the other one in the lower zone was slightly smaller, 0.86%. The error level indicated that the inversion consequence calculated by DQN algorithm was very close to the actual parameter values in case B, meaning the method of this paper had a fine effect on the inversion analysis of the dam with multiple zones.
Table 4
The result of inversion calculation in two zones.
| Zone | Upper | Lower |
| Target (GPa) | 18.0 | 22.0 |
| Result (GPa) | 18.2325 | 22.1893 |
| Absolute error (GPa) | 0.2325 | 0.1893 |
| Relative error (%) | 1.29 | 0.86 |
3.3. Verification with Actual Engineering: Case C
The engineering is a RCC dam on the main stream of a river in Cambodia, with 10 dam sections. The elevation of the dam crest is at 153.00 m, and the bottom surface is at 41.00 m, with a maximum dam height of 112.00 m. The width of the dam crest is 6.00 m. The top elevation of the upstream break slope is 84.0 m, and the slope is 1 : 0.3, and the downstream slope is 1 : 0.75. The mechanical parameters of the rock in the dam foundation are shown in Table 5. Under the long-term action of dam gravity and groundwater, the displacement along the river of the project showed a slow upward trend during the operating period, so the material parameters of the dam foundation should be paid attention to. The target of case C is the elastic modulus of the foundation of the project.
Table 5
The mechanical parameters of the rock in the dam foundation.
| Rock type | Young’s modulus (GPa) | Poisson’s ratio | Shearing strength | Compressive strength (MPa) | Bulk density (kg/m3) |
| Quartz sandstone | 5∼10 | 0.15∼0.23 | C = 4.7∼8.4 MPa | 60∼80 | 2520 |
| Fine sandstone | 7∼8 | 0.18∼0.25 | C = 3∼5 MPa | 45∼55 | 2510 |
| Silty mudstone | 2∼3 | 0.28∼0.30 | C = 0.8∼1.0 MPa | 10∼20 | 2530 |
| Mudstone | 1∼2 | 0.30∼0.35 | C = 0.6∼0.8 MPa | 1∼3 | 2350 |
Step 1: establish the finite element model. This case selected one section of the dam, where the foundation was at 45.5 m, and the dam height was 107.5 m. The length of the dam foundation was 88.0 m, and the size of the dam foundation was 488 m
Table 6
The mechanical parameters of the actual project.
| Component | Density (kg/m3) | Elastic modulus (GPa) | Poisson’s ratio |
| Dam body | 2400 | 25 | 0.167 |
| Foundation | 2520 | EC | 0.200 |
Step 2: select the sample. 221 water-level data, from 125.33 m to 145.96 m, were selected on the dates when the inverted plumb line measured displacement. Because of the unknown actual parameter in the dam foundation, in order to make the training samples contain the possible target, 200 different elastic moduli EC were selected from 3 GPa to 10 GPa according to the values in Table 5. There were 44,200 groups of combination states of the mechanical parameter and water pressure. The model was calculated using software GeHoMadrid to get the node displacement of all states. The result
Step 3: withdraw the water pressure component. The multivariable linear regression model is shown in the following equation:
[figure omitted; refer to PDF]
Step 4: construct the DNN surrogate model. The structure and parameters were the same as those of the DNN model in case A. The samples from step 2 were shuffled randomly, and all data were normalized to [0, 1] according to the data features. After that, training samples occupied 70%, 15% of samples were used to verify the DNN model, and the rest were predicting samples.
The iterative process of the training error and verifying error is shown in Figure 23, where it indicated that, during the former 100 epochs, the two errors decreased sharply to the level close to 0. After the 200 epochs, the network parameters were nearly stable. After the training stage, the DNN model was stored to replace the finite element model in the later steps.
[figure omitted; refer to PDF]
Step 5: construct Agent. The structure and parameters in Agent were the same as those in case A.
Step 6: the calculating target was searching the elastic modulus of the dam foundation to minimize the difference between the inversion result and the actual water pressure component. This maximum number of epoch was 200, and each epoch had 100 time steps. The variation of random probability
3.3.1. Process Analysis
Figures 24 and 25 show that, in the initial period, the model was in the exploration stage, selecting actions randomly, resulting in the fluctuation of the reward. After that, the DQN model moved into the exploitation stage. With the increase of epoch and selecting the right action when facing different states, the absolute value of the reward was decreasing consistently, and the searching parameters kept approaching the target from the initial value 10 GPa in the former 50 epochs before the model was generally stable.
[figure omitted; refer to PDF][figure omitted; refer to PDF]3.3.2. Result Analysis
After the interactive process between Agent and Env, the elastic modulus EC of the dam foundation was 5.1549 GPa. All calculating results are shown in Figures 22 and 26. The former displayed that the blue line indicating the inversion displacement series fitted well with the orange line representing the water pressure component, except a few points with obvious errors, which meant that the displacement values of two lines were close at the same water level on the whole. The latter was the distribution of the absolute error calculated by two displacement series, whose mean value was 0.0712 mm and standard deviation was 0.0985 mm. These errors were mainly concentrated on 0 mm∼0.1 mm. A few values reached 0.3 mm∼0.4 mm. The error level was low in the mass, which indicated that this method in the paper was suitable to be applied in actual engineering.
[figure omitted; refer to PDF]4. Conclusion
The accurate calculation of mechanical parameters in the engineering structure and foundation is dependent on detailed monitoring data of the structure and environment, reasonable constitutive model, and excellent searching algorithm. In this paper, the DNN model with a suitable structure replaced the finite element model and was embedded in the agent of the reinforcement algorithm to form the DQN, which was used to optimize the mechanical parameters in engineering in the global space. The conclusions are as follows:
(1) According to the mechanical parameters and environmental loads of engineering, the corresponding DNN surrogate model was established to replace the finite element model. After the network model was verified, the mean relative error of predicting samples calculated by the DNN model with suitable hyperparameters and a regular training stage was lower than 1%, and the calculating efficiency of the DNN was much higher than that of the constitutive model, which indicated that it was advantageous for a reasonable DNN model to map the relation between the target displacement and the state of different mechanical parameters combining with variable environmental loads.
(2) The DNQ algorithm improving the interactive mode between Env and Agent combined with the DNN surrogate model completed the inversion calculation of the structural mechanical parameter. After the improved framework calculated target values in examples, the maximum relative error and the minimum one of the elastic moduli after searching process were 1.29% and 0.18%, respectively. After the improved algorithm was used in actual engineering, the inversion displacement series fitted well with the water pressure component on the whole. Thus, the DQN algorithm had a good effect in the inversion analysis of mechanical parameters in the hydraulic structure.
(3) The method to express the displacement relation among different dam zones was introduced to ensure the relevance and coordination during the process of optimizing parameters from multizones. This improvement extended the FEM from a single region in case A to a double region in case B, providing a new path for inversion analysis in multiple structural zones.
(4) The research focus is to combine the DNN surrogate model and the improved DQN algorithm and then apply the new model to the inversion calculation of mechanical parameters in the hydraulic structure and foundation with single or multiple zones. In future, the framework could be developed to improve the optimization method applied to inversion analysis in multiple monitoring points and several kinds of mechanical parameters.
Authors’ Contributions
Wei Ji contributed to conceptualization, data curation, formal analysis, methodology, software, visualization, writing, reviewing, and editing. Xiaoqing Liu contributed to funding acquisition, investigation, project administration, supervision, writing, review, and editing. Huijun Qi contributed to conceptualization, methodology, software, visualization, writing, review, and editing. Chaoning Lin contributed to investigation and formal analysis. Xunnan Liu contributed to data curation and software. Tongchun Li contributed to resources and project administration.
Acknowledgments
This research was greatly supported by the National Key Research and Development Plan (no. 2018YFC0407102), the Fundamental Research Funds for the Central Universities (SN: B200202180), and the National Natural Science Foundation of China (SN: 52009035).
[1] W. Ge, Z. Li, R. Y. Liang, W. Li, Y. Cai, "Methodology for establishing risk criteria for dams in developing countries, case study of China," Water Resources Management, vol. 31 no. 13, pp. 4063-4074, DOI: 10.1007/s11269-017-1728-0, 2017.
[2] A. J. Wiley, "The St. Francis dam failure," Journal-American Water Works Association, vol. 20 no. 3, pp. 338-342, DOI: 10.1002/j.1551-8833.1928.tb13638.x, 1928.
[3] P. Habib, "The Malpasset dam failure," Engineering Geology, vol. 24, pp. 295-329, DOI: 10.1016/0013-7952(87)90070-6, 1987.
[4] R. Ardito, G. Maier, G. Massalongo, "Diagnostic analysis of concrete dams based on seasonal hydrostatic loading," Engineering Structures, vol. 30 no. 11, pp. 3176-3185, DOI: 10.1016/j.engstruct.2008.04.008, 2008.
[5] R. Ardito, P. Bartalotta, L. Ceriani, G. Maier, "Diagnostic inverse analysis of concrete dams with statical excitation," Journal of the Mechanical Behavior of Materials, vol. 15 no. 6, pp. 381-390, DOI: 10.1515/jmbm.2004.15.6.381, 2004.
[6] C. Lin, T. Li, X. Liu, "A deformation separation method for gravity dam body and foundation based on the observed displacements," Structural Control and Health Monitoring, vol. 26 no. 2,DOI: 10.1002/stc.2304, 2019.
[7] S. Chen, C. Gu, C. Lin, "Safety monitoring model of a super-high concrete dam by using RBF neural network coupled with kernel principal component analysis," Mathematical Problems in Engineering, vol. 2018,DOI: 10.1155/2018/1712653, 2018.
[8] C. Lin, T. Li, S. Chen, X. Liu, C. Lin, S. Liang, "Gaussian process regression-based forecasting model of dam deformation," Neural Computing and Applications, vol. 31 no. 12, pp. 8503-8518, DOI: 10.1007/s00521-019-04375-7, 2019.
[9] S. Chen, C. Gu, C. Lin, "Multi-kernel optimized relevance vector machine for probabilistic prediction of concrete dam displacement," Engineering with Computers,DOI: 10.1007/s00366-019-00924-9, 2020.
[10] C. Lin, T. Li, S. Chen, "Structural identification in long-term deformation characteristic of dam foundation using meta-heuristic optimization techniques," Advances in Engineering Software, vol. 148,DOI: 10.1016/j.advengsoft.2020.102870, 2020.
[11] R. Perera, S.-E. Fang, A. Ruiz, "Application of particle swarm optimization and genetic algorithms to multiobjective damage identification inverse problems with modelling errors," Meccanica, vol. 45 no. 5, pp. 723-734, DOI: 10.1007/s11012-009-9264-5, 2010.
[12] F. Kang, J. Li, H. Li, "Artificial bee colony algorithm and pattern search hybridized for global optimization," Applied Soft Computing, vol. 13 no. 4, pp. 1781-1791, DOI: 10.1016/j.asoc.2012.12.025, 2013.
[13] F. Kang, J.-s. Li, J.-j. Li, "System reliability analysis of slopes using least squares support vector machines with particle swarm optimization," Neurocomputing, vol. 209, pp. 46-56, DOI: 10.1016/j.neucom.2015.11.122, 2016.
[14] F. Kang, Q. Xu, J. Li, "Slope reliability analysis using surrogate models via new support vector machines with swarm intelligence," Applied Mathematical Modelling, vol. 40 no. 11-12, pp. 6105-6120, DOI: 10.1016/j.apm.2016.01.050, 2016.
[15] F. Kang, J. Li, "Artificial bee colony algorithm optimized support vector regression for system reliability analysis of slopes," Journal of Computing in Civil Engineering, vol. 30 no. 3,DOI: 10.1061/(asce)cp.1943-5487.0000514, 2016.
[16] S. Dou, J. Li, F. Kang, "Parameter identification of concrete dams using swarm intelligence algorithm," Engineering Computations, vol. 34 no. 7,DOI: 10.1108/EC-03-2017-0110, 2017.
[17] R. Nian, J. Liu, B. Huang, "A review on reinforcement learning: Introduction and applications in Industrial process control," Computers & Chemical Engineering, vol. 139,DOI: 10.1016/j.compchemeng.2020.106886, 2020.
[18] R. Sutton, A. Barto, Reinforcement Learning: An Introduction, 1998.
[19] R. Sutton, A. Barto, Reinforcement Learning: An Introduction, 2018.
[20] R. Bellman, "A Markovian decision process," Indiana University Mathematics Journal, vol. 6 no. 4, pp. 679-684, DOI: 10.1512/iumj.1957.6.56038, 1957.
[21] M. R. K. Mes, A. P. Rivera, "Approximate dynamic programming by practical examples," Markov Decision Processes in Practice, 2017. ISBN 978-3-319-47764-0
[22] D. Silver, G. Lever, N. Heess, T. Degris, D. Wierstra, M. Riedmiller, "Deterministic policy gradient algorithms," Proceedings of the 31 st International Conference on Machine Learning, .
[23] V. Mnih, "Playing atari with deep reinforcement learning," 2013. http://arxiv.org/abs//1312.5602
[24] V. Mnih, K. Kavukcuoglu, D. Silver, "Human-level control through deep reinforcement learning," Nature, vol. 518, pp. 529-533, DOI: 10.1038/nature14236, 2015.
[25] Z. Zhang, A. Chong, Y. Pan, C. Zhang, K. P. Lam, "Whole building energy model for HVAC optimal control: a practical framework based on deep reinforcement learning," Energy and Buildings, vol. 199, pp. 472-490, DOI: 10.1016/j.enbuild.2019.07.029, 2019.
[26] Z. Wang, T. Hong, "Reinforcement learning for building controls: the opportunities and challenges," Applied Energy, vol. 269,DOI: 10.1016/j.apenergy.2020.115036, 2020.
[27] Z. Bing, C. Lemke, L. Cheng, "Energy-efficient and damage-recovery slithering gait design for a snake-like robot based on reinforcement learning and inverse reinforcement learning," Neural Networks, vol. 129, pp. 323-333, DOI: 10.1016/j.neunet.2020.05.029, 2020.
[28] I. Carlucho, M. De Paula, G. Acosta, "An adaptive deep reinforcement learning approach for MIMO PID control of mobile robots," ISA Transactions, vol. 102,DOI: 10.1016/j.isatra.2020.02.017, 2020.
[29] J. García, D. Shafie, "Teaching a humanoid robot to walk faster through safe reinforcement learning," Engineering Applications of Artificial Intelligence, vol. 88,DOI: 10.1016/j.engappai.2019.103360, 2020.
[30] F. Li, Q. Jiang, S. Zhang, M. Wei, R. Song, "Robot skill acquisition in assembly process using deep reinforcement learning," Neurocomputing, vol. 345, pp. 92-102, DOI: 10.1016/j.neucom.2019.01.087, 2019.
[31] F. Chang, T. Chen, W. Su, Q. Alsafasfeh, "Control of battery charging based on reinforcement learning and long short-term memory networks," Computers & Electrical Engineering, vol. 85,DOI: 10.1016/j.compeleceng.2020.106670, 2020.
[32] Z. Wu, Safety Monitoring Theory & its Application of Hydraulic Structures, 2003.
[33] K. Hornik, M. Stinchcombe, H. White, "Multilayer feedforward networks are universal approximators," Neural Networks, vol. 2 no. 5, pp. 359-366, DOI: 10.1016/0893-6080(89)90020-8, 1989.
[34] C. J. C. H. Watkins, P. Dayan, "Q-learning," Machine Learning, vol. 8 no. 3-4, pp. 279-292, DOI: 10.1007/bf00992698, 1992.
[35] J. Yang, J. Dai, C. Yao, S. Jiang, C. Zhou, Q. Jiang, "Estimation of rock mass properties in excavation damage zones of rock slopes based on the Hoek-Brown criterion and acoustic testing," International Journal of Rock Mechanics and Mining Sciences, vol. 126,DOI: 10.1016/j.ijrmms.2019.104192, 2020.
[36] Z. Z. Wang, Y. J. Jiang, C. A. Zhu, "Seismic energy response and damage evolution of tunnel lining structures," European Journal of Environmental and Civil Engineering, vol. 23 no. 6, pp. 758-770, DOI: 10.1080/19648189.2017.1304283, 2019.
Copyright © 2020 Wei Ji et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/