1. Introduction
Aeromagnetic surveying, as one of the main ways to explore and survey geological resources, is an important aerogeophysical prospecting method [1,2]. Along with the miniaturization of sensors, aeromagnetic measurement based on UAVs is becoming more widely used. However, data obtained by a magnetic sensor on an aircraft contain magnetic interference information, and the effect of compensation for aeromagnetic interference directly determines the quality of aeromagnetic data [3]. Tolles and Lawson established the T-L model based on magnetic interference and aircraft attitude [4]. Based on this model, Leliak designed a compensation equation enabling the elimination of magnetic interference by solving for a compensation coefficient [5]. Leach regarded magnetic compensation as a linear regression problem and proposed a ridge regression (RR) algorithm to solve the multicollinearity problem in the T-L equation [6]. Wu et al. used principal component analysis (PCA) to reduce the multicollinearity of the T-L model [7].
For the aeromagnetic compensation method based on linear regression, if the aeromagnetic data have a complex or nonlinear structure, the algorithm is unstable, and the fitting ability is poor due to the correlation between the data, which affects the quality of aeromagnetic data [8]. Deep learning is based on the representation learning of data and has a strong fitting ability. Williams proposed the use of neural network modeling for aeromagnetic compensation and established a neural network model of aircraft interference [9]. However, due to the limited computing power available at the time, the model has not been widely used. Zhang analyzed the aircraft swaying interference not considered in the T-L model and used a one-dimensional convolutional neural network to remove the effect of tail-boom swing [10]. Xu established a binary classification network for magnetic anomaly detection and a regression network for geomagnetic noise suppression and applied deep learning to magnetic anomaly detection and noise elimination [11].
However, in the backpropagation process of neural networks, owing to factors such as the chain rule, the gradient will disappear, affecting the accuracy of aeromagnetic compensation. Yu used an autoencoder network for aeromagnetic compensation to extract data features by searching the direction of maximum change in the backpropagation process, thereby reducing the multicollinearity of the model. However, in the training process of the autoencoder network, the feature extraction of high-dimensional complex data is not perfect, the gradient is prone to fall into the local minimum, and the problem of gradient disappearance is not considered [12]. Wang considered the gradient disappearance problem in the autoencoder network training and introduced a residual connection into the autoencoder network. The features output from each layer of the encoder are transmitted to the corresponding layer of the decoder through residual connections. The algorithm is applied to fault diagnosis of generator sets [13]. Ma combined a linear model and neural network model and proposed a dual estimation method for aeromagnetic compensation [14]. Although this method can suppress the noise caused by the overfitting problem of the neural network, owing to the excessive number of iterations, the gradient will approach zero infinitely, and the gradient of the network will disappear, making it difficult to reduce the loss function, thus affecting the compensation effect. Yu considered the main factors that affect the generalization ability of the network, analyzed the probability density function that constitutes the regression neural network, and cited the fully connected neural network for aeromagnetic compensation, which solved the problem of overfitting the network and improved the generalization ability of the network, although the process did not consider the problem of gradient disappearance [15].
To address the gradient disappearance problem of neural networks in aeromagnetic compensation algorithms, in this paper, we combine a residual connection with neural networks and propose a neural network aeromagnetic compensation algorithm (Res-Bp). The features output from the low-level information of the network are transmitted to the high-level information through the residual connection without intermediate operation to realize the cross-layer propagation of the gradient [16]. Residual connection changes the structure of the network and makes it have the ability of identification mapping. It effectively retains the gradient characteristics during backpropagation, alleviates the gradient disappearance problem of traditional neural networks to a certain extent, and improves the accuracy of aeromagnetic compensation. The algorithm is applied to the data collected by UAVs to verify its effectiveness. The remainder of this article is organized as follows. In Section 2, we briefly introduce the T-L model and analyze the reason for the gradient disappearance of the primitive neural network, as well as the principle of the algorithm proposed in this paper. In Section 3, we apply real compensation flight data to conduct experiments and analysis.
2. Compensation Model and Method
2.1. T-L Model
The magnetic interference model of aircraft includes three components: the permanent () magnetic field generated by various parts of the aircraft, the induced () magnetic field produced by geomagnetic field magnetization, and the eddy-current () magnetic field produced by aircraft motion, which can be expressed as:
(1)
(2)
(3)
where are compensation coefficients; are the derivatives of , respectively; and is the Earth’s magnetic field. can be calculated based on three-axis fluxgate magnetometer data as:(4)
The total magnetic interference field can be expressed as:
(5)
The LS solution of the T-L equation is:
(6)
(7)
where is a column vector, consists of(), and:(8)
where are the variables in Equation (5), and the column vector of has multicollinearity.When the independent variable has multicollinearity (), due to the singularity of , there is a strong correlation between variables, which results in a considerable distance between the estimated and true values. There is a deviation in the LS algorithm, which affects the compensation accuracy. Ridge regression (RR) [16] is based on LS, adding an identity matrix to the coefficient matrix to improve stability. Its solution is expressed as:
(9)
where is a positive constant, and is the identity matrix.2.2. Primitive Neural Model
The neural network structure for aeromagnetic compensation includes an input, hidden, and output layer. The input training samples of the network are . The nodes of the neural network are connected by , where is the bias term of layer , and is the weight between layers and .
In the process of forward propagation, the features of layer are:
(10)
where is the number of nodes in layer 2, and is the number of nodes in the input layer, is the activation function, and the features of layer are:(11)
where is feature of layer , and is the weight between node of layer and node of layer .To improve the speed of training convergence, in this paper, we select a ReLU activation function and MSE as the loss function, expressed as:
(12)
(13)
where is the true value, and is the predicted value.In the process of backpropagation, the gradient descent algorithm is used to calculate the gradient of function on each parameter through the derivative chain rule, the global optimal parameters are obtained, and the parameters are updated according to the gradient [17]. Taking two hidden layers as an example, the gradient process of function to is:
(14)
(15)
When one of the terms in Equation (15) is less than 1, with an increase in the number of layers, the gradient update information decays exponentially, so the shallow network parameters cannot be updated. Therefore, the neural network exhibits the phenomenon of gradient disappearance, which leads to an insufficient fitting ability of the model, thus affecting the accuracy of aeromagnetic compensation.
2.3. Res-Bp
To address the problem of gradient disappearance in the aeromagnetic compensation process of the neural network, we introduce the idea of residual connection and propose an algorithm with strong fitting ability, Res-Bp. The core of residual connection is the data fusion of nonlinear and linear features [18]. In forward propagation, the input signal can be directly propagated from any low level to a high level and has identity-mapping ability.
As shown in Figure 1, the residual connection has an identity-mapping structure compared to the ordinary connection. That is, the output () of a layer is added to the new information () of the layer after nonlinear transformation, and the sum result is input into the activation function as the output of the next layer.
In the process of forward propagation, assuming that is a linear activation function, the features of layer are:
(16)
(17)
where represents the nonlinear transformation function.Equations (16) and (17) represent the training process from layer to layer in the forward propagation process of ordinary and residual connections, respectively. Residual connection layer can be expressed as the sum of any layer ( > ) and the residual parts between them, which shows that the residual network always retains the original information in the process of training and obtains new information in the network. The training of an ordinary connection from layer to layer can only be formed through the training iteration of each layer between them.
When the dimensions of input and output are inconsistent, a weight () is necessary to adjust the dimension of to attain consistency. It realizes the residual operation of tensors between different dimensions and jumps out of the limitation of dimensions. It is expressed as:
(18)
In the process of backpropagation, the gradient of loss to is:
(19)
(20)
Equations (19) and (20) represent the gradient process of finding function pair parameters in the backpropagation process of ordinary and residual connections, respectively. When the residual connection is backpropagation, the high-level information can be directly propagated to the low level without the transformation of the intermediate weight matrix [19]. Even if there is gradient attenuation between layers and , the gradient of layer can still be directly transmitted to layer , realizing the cross-layer propagation of the gradient, which can solve the problem of gradient disappearance and improve the accuracy of aeromagnetic compensation. When ordinary connections are backpropagating, gradient information cannot be effectively propagated to the lower layer, so the gradient disappears.
In this paper, the T-L equation is used as the coefficient matrix is the sample size) and as the input parameter of the neural network model. The total () magnetic interference field is used as the output parameter of the model. The Res-Bp structure is as shown in Figure 2. Suppose there are two sets of data: a training set () and a test set (). After the training set is trained by the network model (), the predicted interference field of the test set is:
(21)
The compensated aeromagnetic data are obtained by subtracting the predicted value () from the real value ().
The improvement ratio () is a measurement scale proposed by the Canadian Aviation Research Center (NAE). At present, this measurement scale is adopted internationally and can reflect the performance of the aeromagnetic measurement system [20,21]. Therefore, the standard deviation () and of the aeromagnetic data before and after compensation are used to evaluate the magnetic interference compensation algorithm:
(22)
(23)
where is the arithmetic mean of the variable, and and are the standard deviations of the uncompensated and compensated data, respectively.3. Experiment
3.1. Compensation Flight
To verify the feasibility of the method, we used an unmanned aerial vehicle (UAV) to perform a figure-of-merit (FOM) flight to obtain aeromagnetic data. As shown in Figure 3, the UAV platform is equipped with a three-axis fluxgate magnetometer to measure the three components of the geomagnetic field and a potassium pump magnetometer to measure the total geomagnetic field.
The FOM flight [22] includes maneuvers in four directions, each of which includes three ±5° yaws, three ±5° pitches, and three ±10° rolls. The flight altitude is 50 m, the duration of each maneuver is 10 s, the sampling rate is 10 Hz, and the flight duration is 8–10 min. The FOM flight path is shown in Figure 4, including flight A and flight B. The aeromagnetic interference data of the two flights are shown in Figure 5.
3.2. Compensation Experiment
To determine whether Res-Bp can effectively solve the gradient disappearance problem and improve compensation accuracy, we performed compensation experiments with the data of flights A and B. First, we used the data of flights A and B as the test set and training set, respectively, which are compensated by linear regression methods LS and RR.
The results shown in Figure 6 and Table 1 show that during flight A, LS reduces STD from 2.171 to 0.355, and IR is 6.115. RR reduces STD to 0.324 and IR to 6.701, so the fitting ability of LS is weaker than that of RR, which is a biased estimation regression method for collinear data analysis. By giving up the unbiasedness of LS, the regression coefficient is obtained at the cost of some information loss and reduced accuracy. It has a higher fitting accuracy and stronger stability than LS.
We used the data of flights A and B as the training set and test set, respectively, to compensate flight B and flight A. The compensation results are shown in Figure 7 and Figure 8. Table 2 compares the results of Res-Bp and neural networks (BP). In flight B, Res-Bp reduces STD from 2.334 to 0.260, and IR is 8.977. In flight A, Res-Bp reduces STD from 2.112 to 0.266, and IR is 7.940. Compared with the linear regression method (Table 1), Res-Bp has a stronger compensation ability for complex and nonlinear data. Compared with the neural network, Res-Bp improves the compensation accuracy and fitting ability.
Because the parameter update is related to the gradient of the function, the larger the gradient, the faster the parameter update, which is conducive to the convergence of the network. Figure 9 and the curve (Figure 10a,b) show that the gradient of Res-Bp is larger than that of BP, the convergence speed of Res-Bp is faster than that of BP, and the algorithm is more stable, demonstrating that Res-Bp effectively alleviates the gradient disappearance problem and improves the fitting ability of the network. However, as the network descends along the direction of negative gradient, the principle of weight adjustment is to take the local optimum at each step, leading to the local minimum (Figure 11), and the global optimum solution cannot be obtained. Figure 10 shows that BP and Res-Bp have local minimum values, preventing the network from reaching the expected optimal state, resulting in unstable performance.
The decisive coefficient (R2) measures the goodness of fit [23] and is expressed as:
(24)
where is the real value, is the predicted value, and is the mean real value.We used R2 to measure the performance of the model. The closer R2 is to 1, the better the fitting degree of the regression line to the real value and the better the model’s performance. Therefore, according to the R2 curve (Figure 10c,d), the fitting degree of Res-Bp is stronger than that of the neural network, and it converges faster.
4. Discussion
We demonstrated the effectiveness of combining a residual connection with a neural network in Res-Bp to improve aeromagnetic compensation accuracy. However, some magnetic interference still cannot be fully compensated, with many possible explanations. First, during the FOM flight, the flight control system of the UAV should be manually controlled, using a low-altitude flight to obtain aeromagnetic data. However, due to the influence of control and real-time meteorological factors, the flight attitude of the UAV is unstable and cannot fully comply with the FOM flight. An unstable flight attitude aggravates the multicollinearity of the model and affects the accuracy of the compensation model. Second, the number of training sets may not be sufficient. Multiple FOM flights can be carried out, and multiple flight dataset can be used as a training set to further train the network to improve its generalization ability in order to improve compensation accuracy. Third, it may be affected by factors such as the activation and loss functions in the model. Fourth, there are local minima in Res-Bp and BP in the training process, which affect the fitting ability of the network and therefore the quality of aeromagnetic data. In a future study, the compensation model will be further improved.
5. Conclusions
Based on the linear regression method used to eliminate magnetic interference, problems such as insufficient fitting ability and an unstable algorithm are encountered. Although the neural network is a nonlinear function approximation network with good performance, it’s fitting ability is better than that of linear regression and it is widely used in aeromagnetic compensation owing to the effect of gradient backpropagation and the inevitable problem of gradient disappearance on the fitting ability. Because the FOM flight of the UAV is not standardized, there will be noise in the aeromagnetic data. Even after normalization, the data still contain problems, leading to gradient problems in the network. To improve compensation accuracy, we proposed Res-Bp, which is a neural network fitting compensation algorithm based on the T-L model. By combining residual connection with a neural network, the output of a layer can directly cross several layers as the input of a later layer, which considerably improves the integrity of gradient information in the backpropagation process. Experiments with several groups of FOM flight data show that the algorithm is effective. The results show that Res-Bp has better aeromagnetic compensation ability than traditional neural networks, effectively retaining the gradient in the backpropagation process and avoiding the problem of gradient disappearance. Additionally, the convergence process of the network is accelerated, the propagation of features is strengthened, and the fitting ability of the network and the accuracy of aeromagnetic compensation are improved.
P.Y., J.J. and F.B.: Conceptualization, Methodology, Software; J.J. and S.Z.: Resources Data curation; F.B.: Writing-Original draft preparation; F.B. and Z.S.: Visualization; J.J., F.B. and X.Z.: Writing-Reviewing and Editing. All authors have read and agreed to the published version of the manuscript.
The authors declare no conflict of interest.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure 1. Connection mode of a neural network: (a) ordinary connected structure; (b) residual connected structure.
Figure 4. The FOM flight path. The square represents the beginning of the flight, and the diamond represents the end of the flight.
Figure 5. Magnetic interference during flight: (a) magnetic interference during flight A; (b) magnetic interference during flight B.
Figure 9. The gradient value of the same parameter during Res-Bp and BP training.
Figure 10. Res-Bp and BP loss and R2: (a) compensation loss of flight B data; (b) compensation loss of flight A data; (c) compensation R2 of flight B data; (d) compensation R2 of flight A data.
Figure 10. Res-Bp and BP loss and R2: (a) compensation loss of flight B data; (b) compensation loss of flight A data; (c) compensation R2 of flight B data; (d) compensation R2 of flight A data.
Comparison of compensation results of LS and RR.
Test Set | Training Set | Method | STDu | STDc | IR |
---|---|---|---|---|---|
Flight A | Flight B | LS | 2.171 | 0.355 | 6.115 |
RR | 0.324 | 6.701 |
Comparison of compensation results of BP and Res-Bp.
Test Set | Training Set | Method | STDu | STDc | IR |
---|---|---|---|---|---|
Flight B | Flight A | BP | 2.334 | 0.294 | 7.939 |
Res-Bp | 0.260 | 8.977 | |||
Flight A | Flight B | BP | 2.112 | 0.285 | 7.411 |
Res-Bp | 0.266 | 7.940 |
References
1. Xiong, S.Q. Technological innovation and application of aerogeophysical exploration. J. Geomech.; 2020; 26, pp. 791-818.
2. Meng, Q.K.; Zhou, D.W. History and Prospect of aeromagnetic compensation technology at home and abroad. Geophys. Geochem. Explor.; 2017; 41, pp. 694-699.
3. Noriega, G. Aeromagnetic compensation in gradiometry—Performance, model stability, and robustness. IEEE Geosci. Remote Sens. Lett.; 2014; 12, pp. 117-121. [DOI: https://dx.doi.org/10.1109/LGRS.2014.2328436]
4. Tolles, W.E.; Lawson, J.D. Magnetic Compensation of MAD Equipped Aircraft; Airborne Instruments Lab. Inc.: Mineola, NY, USA, 1950; 201.
5. Leliak, P. Identification and evaluation of magnetic-field sources of magnetic airborne detector equipped aircraft. IRE Trans. Aerospace Navig. Electr.; 1961; 3, pp. 95-105. [DOI: https://dx.doi.org/10.1109/TANE3.1961.4201799]
6. Leach, B.W. Aeromagnetic compensation as a linear regression problem. Information Linkage between Applied Mathematics and Industry II; Academic Press: London, UK, 1980; pp. 139-161.
7. Wu, P.L.; Zhang, Q.Y. Aeromagnetic compensation algorithm based on principal component analysis. J. Sens.; 2018; 2018, 7. [DOI: https://dx.doi.org/10.1155/2018/5798287]
8. Zhao, X.; Yu, P.; Jiao, J. A New Improved Algorithm for Aeromagnetic Compensation. IOP Conf. Ser.: Earth Environ. Sci.; 2021; 660, 012132. [DOI: https://dx.doi.org/10.1088/1755-1315/660/1/012132]
9. Williams, P.M. Aeromagnetic compensation using neural networks. Neural Comput. Appl.; 1993; 1, pp. 207-214. [DOI: https://dx.doi.org/10.1007/BF01414949]
10. Zhang, D.K. Analysis of Aeromagnetic Swing Noise and Corresponding Compensation Method. IEEE Trans. Geosci. Remote Sens.; 2021; 60, pp. 1-10. [DOI: https://dx.doi.org/10.1109/TGRS.2021.3095564]
11. Xu, X.; Huang, L.; Liu, X. DeepMAD: Deep Learning for Magnetic Anomaly Detection and Denoising. IEEE Access; 2020; 8, pp. 121257-121266. [DOI: https://dx.doi.org/10.1109/ACCESS.2020.3006795]
12. Zhao, X.; Jiao, J. An aeromagnetic compensation algorithm based on a deep autoencoder. IEEE Geosci. Remote Sens. Lett.; 2020; 19, 3044999.
13. Wang, Z. Fault Early Warning of Wind Turbine Generator based on Residual Autoencoder Network. Proceedings of the 6th International Conference on High Performance Compilation, Computing and Communications; Jilin, China, 23–25 June 2022.
14. Ma, M.; Zhou, Z.; Cheng, D. A dual estimate method for aeromagnetic compensation. Meas. Sci. Technol.; 2017; 28, 115904. [DOI: https://dx.doi.org/10.1088/1361-6501/aa883b]
15. Yu, P.; Zhao, X.; Jiao, J. An improved neural network method for aeromagnetic compensation. Meas. Sci. Technol.; 2021; 32, 045106. [DOI: https://dx.doi.org/10.1088/1361-6501/abd1b4]
16. He, K.; Zhang, X.; Ren, S. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; Las Vegas, NV, USA, 27–30 June 2016; pp. 770-778.
17. Jiao, J.; Yu, P.; Zhao, X. Real-Time Aeromagnetic Compensation With Compressed and Accelerated Neural Networks. IEEE Geosci. Remote Sens. Lett.; 2022; 19, pp. 1-5. [DOI: https://dx.doi.org/10.1109/LGRS.2022.3142007]
18. Jastrzębski, S.; Arpit, D.; Ballas, N. Residual connections encourage iterative inference. arXiv; 2017; arXiv: 1710.04773
19. He, K.; Zhang, X.; Ren, S. Identity mappings in deep residual networks. European Conference on Computer Vision; Springer: Cham, Switzerland, 2016; pp. 630-645.
20. Hardwick, C.D. Important design considerations for inboard airborne magnetic gradiometers. Geophysics; 1984; 49, pp. 2004-2018. [DOI: https://dx.doi.org/10.1190/1.1441611]
21. Noriega, G. Performance measures in aeromagnetic compensation. Leading Edge; 2011; 30, pp. 1122-1127. [DOI: https://dx.doi.org/10.1190/1.3657070]
22. Zhao, X.; Zheng, X.C.; Jiao, J. Fast Processing Method of Aeromagnetic Compensation Basedon Fluxgate Estimation. J. Jilin Univ. (Earth Sci. Ed.); 2019; 49, pp. 857-864.
23. Li, J.J.; Zhang, J.T. Study on the determinacy of the determinate coefficient of regression model. Stat. Decision Making; 2005; 11, pp. 19-20.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Aeromagnetic compensation is a crucial step in the processing of aeromagnetic data. The aeromagnetic compensation method based on the linear regression model has poorer fitting capacity than the neural network aeromagnetic compensation algorithm. The existing gradient updating neural network-based aeromagnetic compensation algorithm is subject to the problem that the gradient disappears during the backpropagation process, resulting in poor fitting ability and affecting aeromagnetic compensation accuracy. In this paper, we propose a neural network compensation algorithm with strong fitting ability: residual backpropagation neural network (Res-bp). The algorithm realizes the cross-layer propagation of the gradient through a residual connection so that the network not only preserves the original information but also acquires additional information during training, successfully solving the problem of gradient disappearance and boosting the network’s fitting capacity. The algorithm is applied to the data collected by unmanned aerial vehicles (UAVs) to verify its effectiveness. The results show that the improvement ratio is improved compared with the traditional neural network, demonstrating that the algorithm has a significant compensation effect on aeromagnetic interference and improves the quality of aeromagnetic data.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer