Content area
Fault diagnosis for gearbox by robust variational mode decomposition (RVMD) and twin extreme learning machine (TELM) with composite chaotic grey wolf optimizer (CCGWO) is proposed in this study. Robust variational mode decomposition is an advanced signal processing technique designed to decompose complex signals into intrinsic mode functions (IMFs) while maintaining robustness against noise and outliers,which addresses the limitations of variational mode decomposition (VMD), particularly its sensitivity to noise and its tendency to produce suboptimal results in the presence of outliers. The proposed twin extreme learning machine with composite chaotic grey wolf optimizer (CCGTELM) model can extract higher-level features and has higher classification accuracy than traditional ELM. A novel grey wolf optimization algorithm, named composite chaotic grey wolf optimizer (CCGWO), is used to optimize the kernel parameter of TELM. Thus, TELM with CCGWO (DGTELM) is used to fault diagnosis for gearbox.The experimental results demonstrates that fault diagnosis accuracy of RVMD–CCGTELM is higher than VMD-TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN, and RVMD–CCGTELM is suitable for fault diagnosis of gearbox.
Introduction
The gearbox is one of the most critical components of a station wagon. Any defects in the gearbox can cause the station wagon to malfunction, making the study of fault diagnosis methods for gearboxe highly important. In recent years, numerous fault diagnosis techniques have been developed1,2, including artificial neural networks (ANNs)3, 4, 5, 6–7 and extreme learning machines (ELMs)8,9. For instance, Wu et al.10 introduced a method combining wavelet transform and artificial neural networks (WT–ANN) for identifying faults in rotating machinery. To enhance the accuracy of fault diagnosis using artificial neural networks, deep neural networks have been proposed. For example, Li et al.11 utilized deep neural networks for diagnosing faults in planetary gears, while Li et al.12 proposed a method integrating variational mode decomposition and deep neural networks (VMD–DNN) for the same purpose. These studies demonstrate that deep learning methods can significantly improve model performance. Additionally, the network structure of extreme learning machines is simple, and unlike other methods, the connection weights and thresholds between layers do not require adjustment, which significantly enhances the learning and training efficiency of ELMs13, 14–15. As a result, ELMs have been widely applied in fault diagnosis. For example, Silva et al.16 developed an intelligent embedded system for decision support in pulsed eddy current corrosion diagnosis using ELMs. Similarly, Silva et al.17 applied ELMs to study time-of-flight diffraction ultrasound for flaw diagnosis in welded steel plates. The proposed twin extreme learning machine (TELM) generates a pair of non-parallel hyperplanes in a random feature space, utilizing only the least squared loss to minimize empirical error in its optimization problem. This approach enables the extraction of higher-level features and achieves higher classification and recognition accuracy compared to traditional ELM.
Empirical mode decomposition is prone to mode mixing, where signals with different feature time scales may be mixed in the same intrinsic mode function (IMF) component, and endpoint effects are significant during the decomposition process. The decomposition results near the boundaries are unstable, affecting the accuracy of the results. Wavelet transform may generate false fluctuations at signal abrupt changes, resulting in distortion of the constructed time–frequency images. Robust Variational Mode Decomposition (RVMD) is an advanced signal processing technique designed to decompose complex signals into intrinsic mode functions (IMFs) while maintaining robustness against noise and outliers. It is an enhancement of the traditional Variational Mode Decomposition (VMD) algorithm, which is widely used for signal decomposition in various fields such as biomedical engineering, mechanical fault diagnosis, and financial data analysis. RVMD addresses the limitations of VMD, particularly its sensitivity to noise and its tendency to produce suboptimal results in the presence of outliers. By incorporating robust statistical methods and adaptive mechanisms, RVMD ensures more reliable and accurate signal decomposition. RVMD is designed to overcome the limitations of VMD by incorporating robust statistical methods and adaptive mechanisms. RVMD replaces the traditional squared loss function with robust loss functions, such as the Huber loss or the Tukey biweight loss, which are less sensitive to outliers and noise. RVMD introduces adaptive mechanisms for tuning the penalty parameter and the number of IMFs, ensuring optimal performance across different types of signals. RVMD employs nonlinear convergence factors to balance global and local search capabilities, preventing premature convergence and improving the accuracy of the decomposition.
Therefore,fault diagnosis for gearbox by robust variational mode decomposition and twin extreme learning machine with differential evolution-based grey wolf optimization (CCGWO) algorithm is proposed in this paper. As the features of the time–frequency image based on RVMD are clearer than those of VMD, RVMD is used to decompose the vibration signals of gearbox, and create the time–frequency images of the vibration signals of gearbox. Twin extreme learning machine with CCGWO (DGTELM) is used to fault diagnosis for gearbox. To address these issues, this paper proposes a composite chaotic grey wolf optimization algorithm, referred to as CCGWO. In the population initialization phase, the algorithm employs a composite chaotic sequence, Logistic-Cos, which combines the Logistic sequence and the Cos sequence, to initialize the population. This approach enriches population diversity and enhances the ability to find the optimal solution. Composite chaotic grey wolf optimization algorithm is used to optimize the kernel parameter of TELM. The experimental results demonstrates that RVMD–CCGTELM is suitable for fault diagnosis of gearbox of station wagon.
Robust variational mode decomposition
Robust Variational Mode Decomposition (RVMD) is an advanced signal processing technique designed to decompose complex signals into intrinsic mode functions (IMFs) while maintaining robustness against noise and outliers. It is an enhancement of the traditional Variational Mode Decomposition (VMD) algorithm, which is widely used for signal decomposition in various fields such as biomedical engineering, mechanical fault diagnosis, and financial data analysis. RVMD addresses the limitations of VMD, particularly its sensitivity to noise and its tendency to produce suboptimal results in the presence of outliers. By incorporating robust statistical methods and adaptive mechanisms, RVMD ensures more reliable and accurate signal decomposition.
Before delving into RVMD, it is essential to understand the foundational principles of VMD. VMD is a fully adaptive signal decomposition method that decomposes a signal into a finite number of band-limited IMFs. Each IMF is characterized by a specific center frequency and bandwidth. The key idea behind VMD is to formulate the decomposition process as a variational optimization problem18,19, where the goal is to minimize the sum of the bandwidths of all IMFs while ensuring that their sum reconstructs the original signal.
The VMD algorithm solves the following constrained optimization problem:
1
where denotes the Dirac distribution, denotes the convolution operator.The VMD algorithm iteratively updates the IMFs and their center frequencies until convergence is achieved. While VMD is effective in many applications, it has limitations, particularly in noisy environments, where its performance can degrade significantly.The traditional VMD algorithm has several limitations that RVMD aims to address: (1) VMD is highly sensitive to noise, which can distort the extracted IMFs and lead to inaccurate signal decomposition. (2)The presence of outliers in the signal can adversely affect the performance of VMD, causing it to produce unreliable results. (3) VMD relies on predefined parameters, such as the number of IMFs and the penalty parameter, which may not be optimal for all types of signals. (4)VMD may converge to local optima, especially in complex or high-dimensional signals, leading to suboptimal decomposition.
RVMD is designed to overcome the limitations of VMD by incorporating robust statistical methods and adaptive mechanisms. RVMD replaces the traditional squared loss function with robust loss functions, such as the Huber loss or the Tukey biweight loss, which are less sensitive to outliers and noise. RVMD introduces adaptive mechanisms for tuning the penalty parameter and the number of IMFs, ensuring optimal performance across different types of signals. RVMD employs nonlinear convergence factors to balance global and local search capabilities, preventing premature convergence and improving the accuracy of the decomposition. RVMD incorporates regularization techniques to enhance the stability and robustness of the decomposition process.
The RVMD algorithm extends the traditional VMD formulation by introducing robust loss functions and regularization term. The optimization problem in RVMD is formulated as:
2
where denotes the robust loss function,and denotes a regularization term that enhances the stability of the decomposition.The robust loss function is chosen to be less sensitive to outliers and noise. For example, the Huber loss function is defined as:
3
where is a threshold parameter that determines the transition point between the quadratic and linear regions of the loss function.In the RVMD algorithm, IMFs are updated as follows:
4
Update the center frequencies using the updated IMFs:
5
where is the Fourier transform of .RVMD offers several advantages over traditional VMD: (1) By using robust loss functions, RVMD is less sensitive to noise and outliers, ensuring more accurate signal decomposition. (2) RVMD adaptively tunes its parameters, making it suitable for a wide range of signals and applications. (3) The use of nonlinear convergence factors and regularization techniques prevents premature convergence and enhances the accuracy of the decomposition. (4) RVMD can be applied to various types of signals, including biomedical signals, mechanical vibrations, and financial time series.
Figure 1 compares the time–frequency images generated by VMD and RVMD for the no-damage condition, Fig. 2 provides the comparison for the abrasion condition, Fig. 3 shows the comparison for the tooth pitting condition, and Fig. 4 illustrates the comparison for the snaggletooth condition. It is evident that the time–frequency images produced by RVMD exhibit clearer features compared with those generated by VMD. As a result, RVMD is employed in this paper to create the time–frequency images of the gearbox vibration signals.
[See PDF for image]
Fig. 1
Comparison of time–frequency image between VMD and RVMD in the situation of no damage.
[See PDF for image]
Fig. 2
Comparison of time–frequency image between VMD and RVMD in the situation of abrasion.
[See PDF for image]
Fig. 3
Comparison of time–frequency image between VMD and RVMD in the situation of tooth pitting.
[See PDF for image]
Fig. 4
Comparison of time–frequency image between VMD and RVMD in the situation of snaggletooth.
The CCGTELM model
The classification model of ELM,, where is the mapping function matrix, is the weight vector connecting the output layer and the hidden layer, , is the Hidden layer output matrix, is the penalty factor, and I is the identity matrix.The output of extreme learning machine is expressed as .
The core of TELM algorithm is to solve the following two quadratic programming:
6
where denotes the output, denotes the output of hidden layer,, , denotes the weight, denotes the bias, denotes the activation function, denote error vector, denote the penalty parameters.Introducing regularization term , can be obtained as follows:
7
where denotes the Lagrange multipliers.Finally, TELM can be obtained as follows:
8
As the penalty parameters, of kernel function needs to be determined, a differential evolution strategy-based GWO is employed to determine the penalty parameters. Traditional GWO algorithm limits the multi class of the grey wolf population,which will make the algorithm easily converge and fall into local optima.
Although the Grey Wolf Optimization (GWO) algorithm is characterized by its simplicity and efficiency, its performance is highly sensitive to the initial population settings when dealing with complex problems such as high-dimensional and multi-peak optimization. The linear variation of the convergence factor in the search mechanism may lead to premature convergence, making the algorithm prone to falling into local optima and suffering from low search accuracy.
To address these issues, this paper proposes a composite chaotic grey wolf optimization algorithm, referred to as CCGWO. In the population initialization phase, the algorithm employs a composite chaotic sequence, Logistic-Cos, which combines the Logistic sequence and the Cos sequence, to initialize the population. This approach enriches population diversity and enhances the ability to find the optimal solution. During the search phase, a nonlinear convergence factor is introduced to replace the traditional linear convergence factor, balancing global and local search capabilities. This improves the algorithm’s search efficiency, enhances its comprehensiveness and accuracy, and avoids getting trapped in local optima. Furthermore, Cauchy mutation is introduced, leveraging the perturbation capability of the Cauchy operator to provide a broader search space and increase the probability of finding the global optimal solution.
In the Grey Wolf Optimization (GWO) algorithm, the initial distribution of the population significantly impacts its performance20,21. However, the random strategy used in the initialization phase of the traditional GWO algorithm may result in an initial grey wolf population that is either overly concentrated or unevenly distributed. Both scenarios are detrimental to the algorithm’s ability to find the optimal solution. Chaotic systems are highly sensitive to initial conditions, meaning even minor changes can lead to significant differences in the system’s final state. This characteristic enables chaotic initialization to generate an initial population with high randomness and diversity, ensuring uniform distribution and coverage of the entire solution space. This approach helps prevent premature convergence of the optimization algorithm, avoids getting trapped in local optima, and enhances the algorithm’s global search capability, thereby improving the likelihood of finding the global optimal solution.
Here, the objective function can be given as follows:
9
where is the total number of the validation samples, and is the total number of the validation samples with correct validation.The fitness is defined as follows:
10
The Logistic chaotic mapping is widely used in the initialization process of swarm optimization algorithms due to its simplicity, ease of implementation, and single control parameter, which makes it convenient to adjust and embed. However, the Logistic mapping has a limited number of folding iterations, and the generated random numbers tend to cluster at the extremes, which somewhat restricts its effectiveness in improving the initial population distribution. To address this limitation, the CCGWO algorithm adopts a composite chaotic mapping model, Logistic-Cos, which combines the Logistic mapping and the Cos mapping, for population initialization during the initialization phase.
This processing of improving the GWO algorithm by using composite chaotic mapping model is shown as follows:
11
To avoid poor convergence and falling into local optima, the CCGWO algorithm introduces Cauchy mutation, leveraging the perturbation capability of the Cauchy operator to increase the probability of finding the global optimal solution.
As shown in Fig. 5,the processing of determining the parameters of TELM by using differential evolution strategy-based grey wolf optimization algorithm,which is described as follows:
[See PDF for image]
Fig. 5
The processing flow of determining the parameters of TELM by using differential evolution strategy-based grey wolf optimization algorithm.
Step 1 Initialize the range of parameters and configure the parameters of CCGWO. Set the grey wolf population size to 20 and the maximum number of iterations to 100.
Step 2 Define the fitness of each grey wolf individual according to Eq. (10).
Step 3 Calculate the fitness values of each grey wolf individual, determine their hierarchical roles (α, β, and δ), and record their respective positions. Based on the fitness values, classify the grey wolf population into α, β, and δ wolves. During each iteration of the CCGWO search process, the α, β, and δ wolves guide the remaining grey wolf individuals in the hunting process.
Step 4 Update the position of each individual and select the α, β, and δ wolves based on their fitness values. Use composite chaotic mapping to generate a new chaotic population for the α wolf, which has the highest fitness value in the current population. This population is inversely mapped back to the original solution space to assess its fitness. The optimal solution from this population then replaces the corresponding solution in the original population space.
Step 5 The remaining grey wolves locate their target by following the guidance of the α, β, and δ wolves.
Step 6 The process terminates when the maximum number of iterations is reached. Otherwise, loop to Step 3.
Step 7 The optimal parameters of TELM are obtained.
The comparison of the optimization process among CCGWO, GWO, and PSO is given in Fig. 6. It is obvious that the convergence of CCGWO is better than that of GWO,and PSO.
[See PDF for image]
Fig. 6
Comparison of the optimization process among CCGWO, GWO, and PSO.
Experimental analysis
The gearbox fault diagnosis test setup is illustrated in Fig. 5. Vibration signals from the gearbox were captured using a vibration signal acquisition device. As depicted in Fig. 7, the fault diagnosis system for the gearbox comprises a motor, gearbox, coupling, brake, acceleration sensor, corresponding acquisition card, and a computer. The brake can be used to generate the load. The sampling frequency is 5 kHz, and the vibration signals are collected using an acceleration sensor paired with its acquisition card. The datasets include the vibration data from the following two conditions: 1200 rpm with no load and 1800 rpm with a medium load. The gearbox fault diagnosis model is implemented in MATLAB. Three primary gearbox faults—tooth pitting, snaggletooth, and abrasion—are investigated in this experiment.The vibration signals of the state of no damage and the faults of gearbox including tooth pitting, snaggletooth, abrasion are collected,respectively.
[See PDF for image]
Fig. 7
Fault diagnosis test rig of gearbox.
In this experiment, a total of 800 samples are used for training, consisting of 200 samples each for no damage, tooth pitting, snaggletooth, and abrasion. Additionally, 200 samples are used for testing, with 50 samples each for no damage, tooth pitting, snaggletooth, and abrasion. Firstly,as the features of the time–frequency image based on RVMD are clearer than those of VMD, RVMD is used to decompose the vibration signals of gearbox,and create the time–frequency images of the vibration signals of gearbox.Secondly,analysis the correlation of the texture features, the texture features with weak correlation of the time–frequency images are extracted as the input vector of the diagnosis model,and the corresponding state of gearbox is used as the output of the diagnosis model.Thirdly,set the grey wolf population size to 20 and the maximum number of iterations to 100,the ranges of the penalty parameters are respectively set to [1 10000],and the optimal parameters of TELM are obtained by CCGWO.Finally,the optimal TELM can be obtained and used to fault diagnosis for gearbox.For different gearbox’s faults,the bigger the discriminability of the feature is,the more important the feature is. As shown in Figs. 8 and 9, the discriminability ability for gearbox’s faults by RVMD is better than by VMD.Obviously,the correlation between these features is relatively weak, and there are more features with discriminability ability for gearbox’s faults by RVMD than by VMD. In order to highlight the superiority of RVMD–CCGTELM, VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN are respectively compared with RVMD–CCGTELM. The parameters of RVMD–CCGTELM are selected by CCGWO, and the parameters of other models are selected by GWO.
[See PDF for image]
Fig. 8
Comparison of features between tooth pitting and abrasion by RVMD.
[See PDF for image]
Fig. 9
Comparison of features between tooth pitting and abrasion by VMD.
The fault diagnosis results for the gearbox using RVMD–CCGTELM, VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN are presented in Fig. 10. In Fig. 10a, the results from RVMD–CCGTELM show that 2 samples are misdiagnosed. Figure 10b displays the results from VMD–TELM, where 13 samples are misdiagnosed. Figure 10c illustrates the results from VMD–DNN, with 16 samples incorrectly diagnosed. Similarly, Fig. 10d shows the results from VMD–DNN, also with 13 misdiagnosed samples, Fig. 10e presents the results from VMD–LSTM, where 14 samples are incorrectly diagnosed, Fig. 10f presents the results from EMD-ELM, where 18 samples are incorrectly diagnosed, and Fig. 10g presents the results from WT–ANN, where 24 samples are incorrectly diagnosed.
[See PDF for image]
Fig. 10
Comparison of the diagnosis results among RVMD–CCGTELM, VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN.
[See PDF for image]
As summarized in Table 1, fault diagnosis accuracy of RVMD–CCGTELM is 99%, fault diagnosis accuracy of VMD–TELM is 93.5%, fault diagnosis accuracy of VMD–DNN is 92%, fault diagnosis accuracy of VMD–CNN is 93.5%, fault diagnosis accuracy of VMD–LSTM is 93%,fault diagnosis accuracy of EMD–ELM is 91%,fault diagnosis accuracy of WT–ANN is 88%.
Table 1. Comparison of the diagnosis accuracies among RVMD–CCGTELM, VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN.
Method | The number of the samples with correct diagnosis | Accuracy (%) |
|---|---|---|
RVMD–CCGTELM | 198 | 99 |
VMD–TELM | 187 | 93.5 |
VMD–DNN12 | 184 | 92 |
VMD–CNN | 187 | 93.5 |
VMD–LSTM | 186 | 93 |
EMD–ELM | 182 | 91 |
WT–ANN10 | 176 | 88 |
Fault diagnosis accuracy of RVMD–CCGTELM increases by 5.5% compared to VMD–TELM, by 7% compared to VMD–DNN, by 5.5% compared to VMD–CNN, by 6% compared to VMD–LSTM, by 8% compared to EMD–ELM, and by 11% compared to WT–ANN.
These results indicate that RVMD–CCGTELM achieves higher accuracy in fault diagnosis compared with VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN. The experimental findings confirm that RVMD–CCGTELM is well-suited for fault diagnosis of gearbox.
To evaluate the anti-interference capability of RVMD–CCGTELM, comparisons were made with VMD–TELM, VMD–ELM, VMD–DNN, and WT–ANN under white noise interference,impulsive noise, respectively.
The fault diagnosis results for the gearbox using RVMD–CCGTELM, VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN under white noise interference are illustrated. As summarized in Table 2, fault diagnosis accuracy of RVMD–CCGTELM is 98.5%, fault diagnosis accuracy of VMD–TELM is 93%, fault diagnosis accuracy of VMD–DNN is 91%, fault diagnosis accuracy of VMD-CNN is 93%, fault diagnosis accuracy of VMD–LSTM is 92%, fault diagnosis accuracy of EMD–ELM is 90%,fault diagnosis accuracy of WT–ANN is 86.5%.
Table 2. Comparison of the diagnosis accuracies among RVMD–CCGTELM, VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN in the case of white noise interference.
Method | The number of the samples with correct diagnosis | Accuracy (%) |
|---|---|---|
RVMD–CCGTELM | 197 | 98.5 |
VMD–TELM | 186 | 93 |
VMD–DNN12 | 182 | 91 |
VMD–CNN | 186 | 93 |
VMD–LSTM | 184 | 92 |
EMD–ELM | 180 | 90 |
WT–ANN10 | 173 | 86.5 |
Fault diagnosis accuracy of RVMD–CCGTELM increases by 5.5% compared to VMD–TELM, by 7.5% compared to VMD–DNN, by 5.5% compared to VMD–CNN,by 6.5% compared to VMD–LSTM, by 8.5% compared to EMD–ELM,and by 12% compared to WT–ANN.
These results indicate that RVMD–CCGTELM achieves higher accuracy in fault diagnosis compared with VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN. The experimental findings confirm that RVMD–CCGTELM is well-suited for fault diagnosis of gearbox.The experimental findings confirm that RVMD–CCGTELM is well-suited for fault diagnosis of gearbox, even in the presence of white noise interference.
The fault diagnosis results for the gearbox using RVMD–CCGTELM, VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN under impulsive noise interference are illustrated. As summarized in Table 3, fault diagnosis accuracy of RVMD–CCGTELM is 98%, fault diagnosis accuracy of VMD–TELM is 92%, fault diagnosis accuracy of VMD–DNN is 90%, fault diagnosis accuracy of VMD–CNN is 91%, fault diagnosis accuracy of VMD–LSTM is 91.5%, fault diagnosis accuracy of EMD–ELM is 88%, fault diagnosis accuracy of WT–ANN is 85%.
Table 3. Comparison of the diagnosis accuracies among RVMD–CCGTELM, VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN in the case of impulsive noise interference.
Method | The number of the samples with correct diagnosis | Accuracy (%) |
|---|---|---|
RVMD–CCGTELM | 196 | 98 |
VMD–TELM | 184 | 92 |
VMD–DNN12 | 180 | 90 |
VMD–CNN | 182 | 91 |
VMD–LSTM | 183 | 91.5 |
EMD–ELM | 176 | 88 |
WT–ANN10 | 170 | 85 |
Fault diagnosis accuracy of RVMD–CCGTELM increases by 6% compared to VMD–TELM, by 8% compared to VMD–DNN,by 7% compared to VMD–CNN,by 6.5% compared to VMD–LSTM,by 10% compared to EMD–ELM,and by 13% compared to WT–ANN.
These results indicate that RVMD–CCGTELM achieves higher accuracy in fault diagnosis compared with VMD–TELM, VMD–DNN, VMD–CNN, VMD–LSTM, EMD–ELM and WT–ANN. The experimental findings confirm that RVMD–CCGTELM is well-suited for fault diagnosis of gearbox.The experimental findings confirm that RVMD–CCGTELM is well-suited for fault diagnosis of gearbox, even in the presence of impulsive noise interference.
Conclusions
This paper proposes a novel fault diagnosis method for the gearbox utilizing robust variational mode decomposition (RVMD) and twin extreme learning machine (TELM) optimized by chaotic composite grey wolf optimization (CCGWO). The key contributions of this paper are as follows: (1) Since the time–frequency image features derived from robust variational mode decomposition are more distinct than those derived from traditional variational mode decomposition, RVMD is employed to decompose the vibration signals of the gearbox and generate corresponding time–frequency images. (2) Twin ELM is constructed by stacking multiple layers of extreme learning auto-encoders, enabling it to extract higher-level features and achieve superior classification accuracy than extreme learning machines. Additionally, CCGWO is utilized to optimize the kernel parameters of TELM and enhance its performance. Consequently, the proposed twin extreme learning machine with CCGWO (CCGTELM) is applied to fault diagnosis for gearbox. The experimental results demonstrate that RVMD–CCGTELM is highly effective and suitable for fault diagnosis for gearbox of station wagon.
Acknowledgements
This work was supported by 1.Natural Science Foundation of Hainan province (No. 722RC719); 2. Key Laboratory of Island Tourism Resource Data Mining and Monitoring,Ministry of Culture and Tourism (KLITRDMM 2024-5); 3. Scientific Research Foundation of Hainan Tropical Ocean University (No. RHDRCSK202411).
Author contributions
All authors wrote and reviewed the manuscript.
Data availability
The data that support the findings of this study are available from the corresponding author upon request.
Declarations
Competing interests
The authors declare no competing interests.
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
1. Vashishtha, G; Chauhan, S; Sehri, M; Zimroz, R; Dumond, P; Kumar, R; Gupta, MK. A roadmap to fault diagnosis of industrial machines via machine learning: A brief review. Measurement; 2025; 242, 116216.
2. Chauhan, S; Vashishtha, G; Zimroz, R; Kumar, R. A crayfish optimised wavelet filter and its application to fault diagnosis of machine components. Int. J. Adv. Manuf. Technol.; 2024; 135, pp. 1825-1837.
3. Meserkhani, A; Jafari, SM; Rahi, A. Experimental comparison of acoustic emission sensors in the diagnosis of outer race defect of angular contact ball bearings by artificial neural network. Measurement; 2021; 168, 108198.
4. Verma, L; Kremer, F; Chevalier-Jabet, K. Defective PWR fuel rods diagnosis and characterization using an artificial neural network. Prog. Nucl. Energy; 2023; 160, 104686.1:CAS:528:DC%2BB3sXntl2gtL4%3D
5. Al-Athel, KS; Alhasan, MM; Alomari, AS; Arif, AFM. Damage characterization of embedded defects in composites using a hybrid thermography, computational, and artificial neural networks approach. Heliyon; 2022; 8, e10063. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35991970][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9389177]
6. Asteris, PG; Armaghani, DJ; Hatzigeorgiou, GD; Karayannis, CG; Pilakoutas, K. Predicting the shear strength of reinforced concrete beams using artificial neural networks. Comput. Concr.; 2019; 24, pp. 469-488.
7. Armaghani, DJ; Asteris, PG. A comparative study of ANN and ANFIS models for the prediction of cement-based mortar materials compressive strength. Neural Comput. Appl.; 2021; 33, pp. 4501-4532.
8. Dogan, M; Taspinar, YS; Cinar, I; Kursun, R; Ozkan, IA; Koklu, M. Dry bean cultivars classification using deep cnn features and salp swarm algorithm based extreme learning machine. Comput. Electron. Agricult.; 2023; 204, 107575.
9. Asteris, PG; Mamou, A; Ferentinou, M; Tran, TT; Zhou, J. Predicting clay compressibility using a novel Manta ray foraging optimization-based extreme learning machine model. Transp. Geotechn.; 2022; 37, 100861.
10. Jian-Da, Wu; Chan, J-J. Faulted gear identification of a rotating machinery based on wavelet transform and artificial neural network. Expert Syst. Appl.; 2009; 36, pp. 8862-8875.
11. Li, F; Pang, X; Yang, Z. Motor current signal analysis using deep neural networks for planetary gear fault diagnosis. Measurement; 2019; 145, pp. 45-54.2019Meas.145..45L
12. Li, Y; Cheng, G; Liu, C; Chen, X. Study on planetary gear fault diagnosis based on variational mode decomposition and deep neural networks. Measurement; 2018; 130, pp. 94-104.2018Meas.130..94L
13. Priyadharshini, N; Selvanathan, N; Hemalatha, B; Sureshkumar, C. A novel hybrid extreme learning machine and teaching–learning-based optimization algorithm for skin cancer detection. Healthc. Analyt.; 2023; 3, 100161.
14. Khan, MA; Arshad, H; Khan, WZ; Alhaisoni, M; Tariq, U; Hussein, HS; Alshazly, H; Osman, L; Elashry, A. HGRBOL2: Human gait recognition for biometric application using Bayesian optimization and extreme learning machine. Fut. Generat. Comput. Syst.; 2023; 143, pp. 337-348.
15. Alzaqebah, A; Aljarah, I; Al-Kadi, O. A hierarchical intrusion diagnosis system based on extreme learning machine and nature-inspired optimization. Comput. Secur.; 2023; 124, 102957.
16. Silva, MM, Jr; Simas Filho, EF; Farias, PC; Albuquerque, MC; Silva, IC; Farias, CT. Intelligent embedded system for decision support in pulsed eddy current corrosion detection using extreme learning machine. Measurement; 2021; 185, 110069.
17. Silva, LC; Simas Filho, EF; Albuquerque, MC; Silva, IC; Farias, CT. Segmented analysis of time-of-flight diffraction ultrasound for flaw detection in welded steel plates using extreme learning machines. Ultrasonics; 2020; 102, 106057.1:CAS:528:DC%2BB3cXptlarsA%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31952796]
18. Fang, M; Zhang, F et al. The influence of variational mode decomposition on LSTM prediction accuracy: A case study with wind turbine power signals. Renew. Energy; 2025; 245, 122863.
19. Chen, J; Li, S et al. Significant wave height prediction based on variational mode decomposition and dual network model. Ocean Eng.; 2025; 323, 120533.
20. Liu, Z; Zhang, Z et al. Prediction of landfill gases concentration based on grey wolf optimization-support vector regression during landfill excavation process. Waste Manage.; 2025; 198, pp. 128-136.1:CAS:528:DC%2BB2MXltlCqtrs%3D
21. Yin, W; Jia, M et al. Advanced power curve modeling for wind turbines: A multivariable approach with SGBRT and grey wolf optimization. Energy Convers. Manage.; 2025; 332, 119680.
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.