Full Text

Turn on search term navigation

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1. Introduction

Unmanned surface vehicles (USVs) are mainly preferred by missions that are characterized as dull, dangerous, or ill-suited for manned ships. In the future, they will be developed for ocean mapping, hydrographic and meteorological monitoring, maritime search and rescue, etc. Autonomous control is the core technology of USV navigation. It belongs to motion control technology, which includes set-point regulation control [1], path following control [2], and trajectory tracking control.

Trajectory tracking is defined as the control for actual track of vehicle so that the vehicle can track the position of Cartesian coordinate relative to time [3]. The optimal time-varying path for trajectory tracking is derived from the dynamic model of the vehicle and a predefined target [4]. The difficulty is the impact of uncertain sea environment. Trajectory tracking is a fundamental capability for USV to perform missions such as automatic collision avoidance and cooperative formation. Therefore, we choose trajectory tracking control as the research issue.

This study follows the framework of the Guidance-Navigation-Control (GNC) system to solve the trajectory tracking problem of USV. The GNC system framework is a two-stage process consisting of guidance and control [4]. The guidance process refers to the transformation of a vehicle position to its heading and speed in the kinematics domain by means of guidance law. The control process refers to the transformation of heading to rudder angle and speed to throttle in the kinetics domain by means of control law. The essence of trajectory tracking control is to minimize the position error between the actual track and the reference track. Therefore, the performance of trajectory tracking is mostly dependent on the guidance process [5].

The object of this study is the guidance process of trajectory tracking control for USV. The modeling approach and experimental platform are founded on Mission Oriented Operating Suite Interval Programming (MOOS-IvP) [6]. MOOS-IvP is a research platform for autonomous maritime vehicles, open-sourced by MIT. It conforms to the framework of GNC system and divides guidance and control processes. The waypoint behavior in MOOS-IvP models the process of trajectory tracking, which is a Line-of-Sight (LOS) guidance method. The feedback effect generated by control law, vehicle, and its interaction with the environment is regarded as a whole. Trajectory tracking control of USV is accomplished by adjusting the LOS guidance parameters. MOOS-IvP can be deployed in real ships, so our work also enables its application to real-world scenarios.

On the other hand, the dramatic technological evolution of deep learning has delivered new insights and approaches for the study of USV autonomous navigation. Deep neural network (DNN) recognizes and extracts the relationship between the combined features. It is suitable for uncertain sea environment and complex vehicle motion control process [7]. In this paper, a dual-DNN model is established, and back propagation method is used to train the USV by taking the navigation data samples under different parameters such as speed and steering angle in a simulated environment. The trained DNN model is capable of predicting the tracking effect and estimating better guidance law parameters, so as to improve the trajectory tracking control process of USV.

During the last decade, a large number of methods for trajectory tracking control have been developed, but in practice there are still many difficulties. Velagic et al. proposed an adaptive fuzzy controller [8] and built a ship dynamics model, a steering equipment model, and a wind-flow disturbance model. All models have been simplified, including reducing dimensions, adding constraints, and removing higher order interference terms. On the other hand, the model parameters were adjusted according to predefined 49 fuzzy rules. If the ship encountered conditions that did not appear in the rules, it would be difficult to control them precisely. Aguiar et al. presented a nonlinear control algorithm according to Lyapunov theory and proved the global convergence of the model without constraints [9]. Lv et al. proposed a hybrid cooperative signal energy control law, which dealt with the speed and course control of ship. However, they did not consider the uncertainty of resistance and disturbance [10]. Xia et al. developed a dynamic model and an adaptive controller to promote tracking performance and convergence speed, but they did not consider the parametric perturbations caused by power devices [11]. Huang et al. decomposed the trajectory tracking problem into guidance law and control law loops, which are easier to solve than direct control methods, but they did not take into account external disturbances from the marine environment [12]. The traditional methods adopted in these studies aimed to build complex models with many parameters, and most of them did not consider the impact brought by real environment and equipment.

In recent years, many studies have used artificial neural network (ANN) in the control issues of USV, especially in trajectory tracking control problems. ANN and data-driven machine learning method have shown success in the analysis of uncertain and environmentally sensitive ship motion control problems [13, 14]. Cheng et al. used ANN as a replacement model for the traditional model to solve the problem of ship berthing [15]. Shuai et al. constructed two ANNs to extract features for controlling the ship propeller and rudder, respectively, to achieve automatic berthing under different environmental disturbances [16]. Zhang et al. proposed an adaptive robust ANN method for modeling uncertain ship dynamics and external influences and achieved good results in automatic ship berthing [17]. The ANN proposed by Wang et al. was used to control the course of USV and solve the problem of uncertainty in ship motion [18]. In the above literature works, the influence of sea environment has been considered, and the ANN method has been improved over traditional methods. However, since the ANN is shallow, it is difficult to learn the characteristic relationships of the parameters, so they selected a simple scenario of berthing or simplified the problem.

In recent years, DNN has demonstrated unprecedented ability in the field of traffic and control [19]. Kim et al. exploited DNN-based feedback controllers to compensate for the disturbance of curved road and reduced tracking error in lane keeping [20]. Xu et al. used DNN to learn the complex manipulation characteristics of USV based on the visual system [21]. Chen et al. proposed a DNN-based data-driven control method that greatly improved the capability and accuracy of control systems [22]. Deep belief network was proposed by Tan et al. to address the navigational safety of unmanned aerial vehicles [23]. It can be seen that DNN has shown stronger feature learning ability than shallow ANN [24]. However, it is not widely used in the trajectory tracking control problem of USV.

DNN has a gradient dispersion problem, which is generally solved by “pretraining and fine-tuning” method. Hinton et al. proposed a method of pretraining the restricted Boltzmann machine (RBM) in each layer, followed by fine-tuning the DNN [25]. Goudarzi et al. proposed two stacked RBMs for predicting short-term traffic flow by using the “pretraining and fine-tuning” method [26]. Zhao et al. developed a deep belief network consisting of several RBMs stacked, to reduce the risk of vehicle collision in snow and ice conditions on highways [27]. Pretraining of DNNs by using RBM is a common method [25]. However, RBM has been originally developed for binary vector modeling, where both visible and hidden layer variables are binary. In this study, the visible layer variables are types of continuous values, so binary RBM cannot be sufficiently used. Yamashita [28] gave a method for continuous value type vectors, i.e., Gaussian–Bernoulli restricted Boltzmann machine (GB-RBM), so we use GB-RBM for pretraining DNN to improve the performance, and make it applicable to our research.

In the implementation of intelligent navigation systems for USV, MOOS-IvP has been very popular in academic and industrial research fields nowadays, and it provides good support for autonomous navigation [29]. Firstly, MOOS-IvP provides a simulated experimental environment for USV. For example, Dong et al. used the MOOS-IvP platform to experiment with different test items for distributed remote control of USVs [30]. Secondly, MOOS-IvP provides integrated interfaces for software development and algorithm implementation. For example, the instruction filter control module developed by Djapic et al. was integrated into MOOS-IvP [31]. In addition, a set of algorithms for generating waypoints developed by Benjamin et al. have been used for path planning in MOOS-IvP [32]. This study is inspired by them. On the one hand, MOOS-IvP is used to acquire data and perform experiments. On the other hand, the developed DNN model is integrated into MOOS-IvP. From the view of artificial intelligence computing, this is also an upgrade of MOOS-IvP platform.

In this study, a deep learning methodology is utilized to predict the parameters of the waypoint behavior in MOOS-IvP, and the DNN prediction model is implemented in parallel with the control loop to achieve the trajectory tracking of USV. The whole project is implemented in two stages: In the first stage, a classification model based on DNN was constructed to provide assistance and reference for maneuvering decisions of USV [7]. In the second stage, we regard the feedback effect generated by the control law, vehicle, and its interaction with the environment as a whole and connect DNN in parallel with the control loop of USV, so as to predict LOS guidance law parameters in real time during voyage.

To predict the LOS guidance law parameters accurately in the second stage, we add a pretraining process by using GB-RBM, which improves the accuracy of model classification to 89.9% with an increase of 5% over the previous stage.

On the basis of the above works, the “waypoint behavior effect evaluation model” and the “real-time LOS parameter valuation model” are constructed based on DNN, denoted as DNN-1 and DNN-2, respectively. They are connected in parallel with the trajectory tracking control loop of USV. In the process of voyage, DNN-1 is used to predict the effect of navigation at first, and then the LOS parameters are given by DNN-2 when the effect is not good. The new LOS parameters are configured to adjust the waypoint behavior of USV. In this way, it is not only introducing intelligent computing to the control loop but also maintaining the reliability of the traditional control process as far as possible. At the same time, it also takes into account the traditional habit of steering infrequently in the ship maneuvering. In addition, we develop a new MOOS-IvP application to perform the computing of DNN model, and establish an interface between the trained DNN and the MOOS-IvP platform.

The contribution herein mainly includes the following three aspects:

(1) With the deep learning training method of “pretraining and fine-tuning” and the model of GB-RBM, the prediction accuracy of classification model is improved. GB-RBM can fit the numerical data to prevent the model falling into local optimum.

(2) A new predictive-based trajectory tracking control model has been innovatively constructed. The model consists of two DNNs, i.e., “DNN-1: waypoint behavior effect evaluation model” and “DNN-2: real-time LOS parameter valuation model.” We connect DNNs in parallel with the control loop of USV, which can obviously improve the trajectory tracking effect.

(3) Intelligent trajectory tracking of USV is achieved by dynamically connecting the DNN model to a new application developed in MOOS-IvP. MOOS-IvP can be plugged into the real vehicle, so the application developed in this study can be employed in real maritime scenarios as well.

The rest of the paper is organized as follows: Section 2 starts with a description of the general process of trajectory tracking control based on waypoint behavior. Then, the method of connecting DNN in parallel with the control loop of USV is given. Finally, the implementation in the MOOS-IvP system architecture is illustrated and the waypoint behavior dataset is described. Section 3 first presents the overall implementation framework for training and accessing the DNN model into the control loop. Then the principle, construction, and training of the DNN model are given, and the implementation method and process of how to access the trained model into the control loop of USV are explained. In Section 4, the experimental simulation results are presented and analyzed in respect of model training, optimization effect of GB-RBM, and trajectory tracking control effect. Section 5 summarizes the paper.

2. Problem Formulation and System Architecture

2.1. General Process of Trajectory Tracking

The general process for a USV to perform trajectory tracking is to generate a set of waypoints based on the mission of the voyage. Then, the USV sequentially moves towards the waypoints and follows the planned route sailing. It can be divided into 5 phases, as shown in Figure 1:

(1) Output speed and steering orders by the guidance algorithm to guide USV towards the next waypoint

(2) Transform speed and steering orders to throttle and rudder actions

(3) Under wind, waves and current conditions, the USV navigates in the sea

(4) Send the feedback of USV speed and heading to the control module, which outputs new throttle and rudder angle actions

(5) Send the feedback of actual position of USV to the guidance module, which outputs new speed and steering orders

[figure omitted; refer to PDF]

According to the general process above, we consider the feedback effects produced by phases (2)-(4) as a whole and model the trajectory tracking control problem of USV as a waypoint behavior based on guidance algorithm.

This study is an optimization of the lookahead-based guidance algorithm by deep artificial neural networks. As shown in Figure 2(a), the lookahead-based guidance algorithm can be formulated as a geometric relationship between the vehicle, the previous waypoint, and the next waypoint [33].

[figures omitted; refer to PDF]

Primarily, the angle formed by the two waypoints is $α$ , as in $\begin{matrix} (1) & α = \tan^{- 1} y_{k + 1} - y_{k}, x_{k + 1} - x_{k} . \end{matrix}$

Then, the lead distance $l_{1}$ and lead damper $l_{2}$ can be derived from $α$ and the position of the vehicle $x_{t}, y_{t}$ , as shown in $\begin{matrix} (2) & l_{1} = x_{t} - x_{k} \cos α + y_{t} - y_{k} \sin α, \\ (3) & l_{2} = - x_{t} - x_{k} \sin α + y_{t} - y_{k} \cos α, \end{matrix}$ where $l_{1}$ is the distance between the point of LOS and the vertical foot of the vehicle to the planned route. Generally, it takes 1.5–2.5 times the length of the vehicle. $l_{2}$ is the distance from the vehicle to the vertical foot. The trajectory tracking control is to reduce the track error approaching zero by regulating the $l_{1}$ and $l_{2}$ .

In addition, there are two circles associated with trajectory tracking during the planned route involving several waypoints. The inner circle is called the capture circle, which signifies the arrival of the vehicle to a waypoint while sailing in still water. The outer circle, known as the slip circle, marks the arrival when it is affected by wind, waves, and current, as illustrated in Figure 2(b).

2.2. DNN-Based Parallel Process of Trajectory Tracking

The DNN model proposed in this paper is in parallel with the general control process described above, as shown in Figure 3. We send the data from the navigation system of vehicle to the LOS guidance module and the DNN prediction model simultaneously and send the steering angle of the waypoint to the prediction model in advance. The prediction model consists of two submodels, where DNN-1 predicts the navigational effect firstly. If it works well, there will be no adjustment of parameters. Otherwise, DNN-2 is used to predict the relevant parameters of LOS algorithm, and the lead distance and the lead damper are adjusted to indirectly control the ship navigation to achieve better waypoint behavior effect.

[figure omitted; refer to PDF]

This dual-DNN predictive model of trajectory tracking control using a parallel access approach is different from previous intelligent control models. It is not directly connected to the control loop. There are two advantages:

(1) It is based on deep learning methodology for predictive model, analogous to ship officer which does not directly change guidance law and control law of the vehicle. If the original algorithm is good, it will not affect the ship navigation. Only when the navigation effect is predicted to be bad, the guidance law parameters are adjusted to improve the navigation effect of vehicle in the kinematic.

(2) After long-term application and verification, the traditional control model is relatively reliable in engineering. The new model is parallel with the traditional model, which greatly enhances the practical value. In particular, when the navigation effect is good, DNN-2 is not involved in the control. It can also prevent frequent rudder manipulation, which is more in line with the regular mode of ship maneuvering.

2.3. System Architecture Based on MOOS-IvP

The autonomous navigation system of USV in this study adopts the system architecture of MOOS-IvP. MOOS-IvP was initially used on the Bluefin Odyssey III vehicle of MIT. The main motivation is to build high-performance autonomous systems [29]. Mission Oriented Operating Suite (MOOS) is a set of software components that provide a framework for the coordinated operation of multiple individual processes. Interval Programming (IvP) is a solution to the problems of multiobjective optimization, which is used for organizing various behaviors to achieve autonomous navigation of USV.

The system architecture of MOOS incorporates publish-subscribe middleware. Each MOOS application (MOOS app) interacts with information by connecting to a MOOS database (MOOSDB), and they form a star topology. As shown in Figure 4, the general process for trajectory tracking control of USV is implemented by a set of MOOS apps. The “pHelmIvP” app is a guidance module, “pMarinePID” app is a control module, “pNodeReport” app is a navigation module, and “uSimMarine” app is a simulation module of wind, waves, currents, and hull effects. All of them are connected to MOOSDB, constituting the autonomous navigation simulation system of USV. The DNN-based parallel control process is realized by adding a MOOS app called “pDeepLearning.” “pDeepLearning” is a new MOOS app developed in this study, which subscribes to the vehicle speed and course and publishes the predicted values given by DNN model.

[figure omitted; refer to PDF]

The guidance algorithm of USV is implemented by the instance of waypoint behavior contained in “pHelmIvP.” The configuration parameters of behavior instance correspond to the variables of guidance algorithm. These parameters are stored in a “.bhv” file and invoked during the initialization of the mission. Once the mission is launched, the “.bhv” file can no longer be modified.

In this study, it is necessary to dynamically configure USV waypoint behavior with the predictions given by DNN model. This allows the USV to adjust the guidance parameters according to the navigation state. We use the “updates” parameters to publish the variables to the MOOSDB. To do this, it is necessary to configure a WPT_UPDATE variable in the “.bhv” file, as shown in Table 1.

Table 1

Waypoint behavior configuration file.

Configured variable	Configured value
Behavior name	BHV_Waypoint
Behavior instance name	waypt_survey
Lead distance	8.0
Lead damper	5.0
Capture circle radius	7.0
Slip circle radius	14.0
Updates parameter	WPT_UPDATE

In the process of voyage, the WPT_UPDATE variable is used to publish specific content for changing parameters in the configuration file. For example, if WPT_UPDATE = “lead_distance = 6.90,” it would immediately change the lead distance value of the USV from 8.0 to 6.9 in the original configuration file.

2.4. Dataset and Statistical Analysis

In previous work, we have made the waypoint behavior dataset [7]. The training samples comprise six features that are speed and steering angle, lead distance, lead damper, capture radius, and slip radius. These features can be classified into 3 categories, which are related to the definition of waypoint, guidance algorithm, and navigation process, respectively, as shown in Table 2. The training labels are different effect levels for USV, belonging to levels I∼III; therein, corresponding times vehicle lengths are as shown in Figure 5.

Table 2

Categories and features of waypoint behavior for USV in dataset.

Number	Categories	Features
1	Related to the definition of waypoint	Capture radius
2		Slip radius

3	Related to guidance algorithm	Lead distance
4	Related to guidance algorithm	Lead damper

5	Related to navigation	Speed
6	Related to navigation	Steering angle

[figures omitted; refer to PDF]

In this study, a preliminary statistical analysis of the dataset is conducted. From the statistical analysis in Figure 6, the categorical items are evenly distributed, and no special statistical patterns can be seen. However, these parameters do affect the effectiveness of USV. It is necessary to mine them with deep learning methods.

[figures omitted; refer to PDF]

3. Methods

3.1. Dual-DNN Prediction Model

A predictive DNN model for trajectory tracking control is established. It consists of two submodels. The first submodel is DNN-1, which is used to predict the influence of waypoint behavior parameters on the drift effect. It is a “6-input-5-output” classification model, using waypoint behavior dataset for training. The second submodel is DNN-2, which is used to estimate the values of two guidance parameters: lead distance and lead damper. It is a “4-input-2-output” regression model, using the part of waypoint behavior dataset with better navigation performance for training.

The DNN model is designed to identify the complex effects of various factors and their combinations in the waypoint behavior of USV on high-dimensional spatial planes. In addition, the nonlinear effects are caused by the power unit installed in the vehicle and the effect of wind and waves on the vehicle. They have been involved in the acquisition of the data of waypoint behavior.

3.1.1. DNN-1:Waypoint Behavior Effect Evaluation Model

DNN-1 is a feedforward network with N_{6-6-7-7-8-7-6-5}, as shown in Figure 7. The input layer corresponds to six features of waypoint behavior. The activation function in hidden layer is ReLU function. The output layer corresponds to five levels of effect. The loss function is cross-entropy.

[figure omitted; refer to PDF]

The GB-RBM is a neural network based on energy. The combined energy functions of the visible and hidden variables are $\begin{matrix} (4) & E x, h = - h^{T} W \frac{x}{σ} - \frac{{x - c^{T}}^{2}}{2 σ^{2}} - b^{T} h, \end{matrix}$ where the visible layer random vector $x = {x_{1}, x_{2}, x_{3}, \dots, x_{6}}^{T}$ ; the hidden layer random vector $h = {h_{1}, h_{2}, h_{3}, h_{4}, h_{5}, h_{6}}^{T}$ ; the weight matrix $W \in R^{6 \times 6}$ , and each element $w_{j k}$ is the weight of connections between the visible layer variable $x_{k}$ and the hidden layer variable $h_{j}$ ; the bias $c \in R^{6}$ and $b \in R^{6}$ ; and $σ$ is the standard deviation associated with Gaussian visible vector $x$ .

After defining the joint energy function of $x$ and $h$ , it can get the joint probability of $x$ and $h$ , as shown in (5). $Z$ in (6) is the normalized factor also known as the partition function, which is the sum numbers $α$ of all the states of the system. $\begin{matrix} (5) & ℙ x, h = \frac{e^{- E x, h}}{Z}, \end{matrix}$ where $\begin{matrix} (6) & Z = \sum_{α} e^{- E_{α} / K T} . \end{matrix}$

3.2.3. The Training Process of GB-RBM

The training of GB-RBM is divided into two processes: (1) the coding process, also known as forward propagation; (2) the decoding process, also called back propagation or reconstruction process.

In the coding process, given the features in the visible layer, calculate the probability that a neuron in the hidden layer will be activated by sigmoid function, as shown in $\begin{matrix} (7) & ℙ h_{j} = 1 | x = sigmoid w_{j} \frac{x}{σ^{2}} + b_{j} . \end{matrix}$ Then, the randomizer generates a number from 0 to 1. If the number is less than the calculated $h_{j}$ , then the hidden layer node takes 1; otherwise, it takes 0.

In the decoding process, given the current state of all neurons in the hidden layer, we calculate the probability that a neuron in the visible layer will be activated, as shown in (8). Other than RBM, the mean $μ$ and variance $σ^{2}$ that conform to the Gaussian distribution should be added for GB-RBM. $\begin{matrix} (8) & ℙ x_{k} = 1 | h = N c_{k} w_{k} + x, σ^{2}, \end{matrix}$ where $N μ, σ^{2}$ denotes the Gaussian probability density function with mean $μ$ and standard deviation $σ$ .

Then, the randomizer generates a number from 0 to 1. If the number is less than the calculated $x_{k}$ , then the visible layer node is $x_{k}$ ; otherwise, it takes that random number.

After training by performing coding process and decoding process alternately, the reconstruction error of $x$ and $ℙ x_{k} = 1 | h$ is very small, which indicates that the GB-RBM tends to stabilize.

In the training process of GB-RBM, an efficient CD algorithm which is common in deep learning has been applied, and it is illustrated in Algorithm 1. In this way, GB-RBM can be trained in the same way as a normal RBM. In Section 4.1, the experimental results show the improvement of classification accuracy after using GB-RBM.

Algorithm 1: CD for training GB-RBM in the waypoint behavior dataset.

Input: Dataset x (n), n = 1,…, N;

Output: W, c, b

(1) Set learning rate： $α = 0.001$ , epoch: $T = 150$ ;

(2) Initial：W ⟵ 0, c ⟵ 0, b ⟵ 0;

(3) Calculate mean value and variance $σ$ of vectors in dataset;

(4) for t = 1 …T do

(5) for n = 1…N do

(6) choose an input vector $x$ ，calculate $p h_{j} = 1 | x$ by using Equation (4)，and randomly choose a hidden vector $h$ according the distribution;

(7) calculate positive gradient $x h / σ^{2}$

(8) according to $h$ , calculate $p x_{k} = 1 | h$ by using Equation (5), obtain $x^{'}$

(9) according to $x^{'}$ , calculate $p h_{j} = 1 | x^{'}$ by using Equation (4), obtain $h^{'}$ ;

(10) calculate reverse gradient $x^{'} h^{'} / σ^{2}$

(11) W ← W + $α x h / σ^{2} - x^{'} h^{'} / σ^{2}$

(12) c ← c + $α x / σ^{2} - x^{'} / σ^{2}$

(13) b ← b + $α h - h^{'}$

(14) end

(15) end

3.3. Model Prediction and Invocation

3.3.1. Development of Prediction Scripts

The two DNNs are optimized by GB-RBM, followed by fine-tuning. After training, the saved model structure and parameters are used to regenerate the prediction script. The essence of the prediction script is a function that calls the neural network model and gives the prediction results according to the input variables.

LevelPredictor.py is a python script used to evaluate the effects of waypoint behavior, whose main function is to call the saved .h5 classification model file to predict. LDPredictor.py is a python script used to predict the waypoint behavior parameters, and its main function is to estimate the LOS parameters.

3.3.2. Invocation of DNN Model by Using “pDeepLearning”

A new MOOS app is developed to perform the computing of the trajectory tracking control prediction model, so as to establish the interface between the trained deep neural network and the MOOS-IvP platform. “pDeepLearning” is the key module for all types of information interaction. It is a C++ program inherited from the CMOOSApp class in MOOS. On the one hand, pDeepLearning is used to publish and subscribe the data in MOOS. On the other hand, DNN implemented as python scripts is called by pDeepLearning.

We deployed a set of MOOS apps and MOOSDB, for the USV, that perform waypoint behavior. The DNN is then integrated into the MOOS-IvP by loading “pDeepLearning.” Figure 11 shows the system structure of USV with DNN model.

[figure omitted; refer to PDF]

During the voyage, “pDeepLearning” first receives the speed and planned steering angle of USV from MOOS-IvP. Then, “pDeepLearning” calls the LevelPredictor.py and LDPredictor.py scripts that are used in predicting the waypoint behavior effect and LOS parameters. Finally, the WPT_UPDATE variable described in Section 2.3 is used to publish the parameters lead distance and lead damper into MOOS-IvP. Section 4.3 reveals the performance of pDeepLearning.

4. Results and Discussion

The experimental results are carried out in two aspects: Firstly, we take the previous research as the benchmark. The classification accuracy is improved after the DNN model adopts the “pretraining and fine-tuning” method by GB-RBM. Secondly, an experimental platform is constructed based on MOOS-IvP, and the effect of running deep learning application pDeepLearning in MOOS-IvP for trajectory tracking control can be seen.

4.1. The Effect of Training GB-RBM

Figure 12 shows that the first layer of the classification model and regression model used GB-RBM matter to carry on the pretraining process, respectively, in which the horizontal axis shows the epoch times, and the vertical axis represents the reconstruction error between the visible layer and the hidden layer. Besides, we choose that mean square error which is commonly used in deep learning training. The experimental results show that the reconstruction error of the two GB-RBM are smaller through training; the reconstruction error of the classification model tends to be 0.09, as shown in Figure 12(a)), and that of the regression model tends to be 0.03, as shown in Figure 12(b), which suggests that the artificial neural network has the ability to restore the original data after transformation between the visible layer and the hidden layer. The GB-RBM proposed has learned the features of the waypoint behavior.

[figures omitted; refer to PDF]

4.2. The Effect of Classification Accuracy

The accuracy varies for different depths and widths of the DNN structure. We compare the accuracy between initialization parameters using GB-RBM pretraining and no pretraining phase. The results are shown in Tables 3 and 4, respectively.

Table 3

Training effect by using different depth structures.

Depth	Accuracy on validation set		Accuracy on test set
Depth	No pretraining	Pretraining	No pretraining	Pretraining
5	80.1	84.8	72.3	77.8
7	81.4	85.1	73.5	75.2
8	88.2	91.3	84.9	88.9
9	78.2	82.5	73.5	75.9
11	76.7	80.8	67.4	70.1
12	52.1	55.7	40.8	42.2

Table 4

Training effect by using different width structures.

Width	Accuracy on validation set		Accuracy on test set
Width	No pretraining	Pretraining	No pretraining (%)	Pretraining (%)
6-6-6-6-6-6-6-5	82.8	83.2	81.0	84.5
6-7-7-7-7-7-7-5	82.8	87.3	76.2	80.3
6-6-7-7-8-7-6-5	88.2	91.3	84.9	88.9
6-8-8-8-8-8-8-5	63.9	66.2	61.9	71.8
6-10-10-10-10-10-10-5	9.7	71.4	66.7	70.2
6-6-6-6-6-6-6-5	82.8	82.9	81.0	82.6

Experiments show that the structure of DNN, which is 6-6-7-7-8-7-6-5 nodes in each layer, has a maximum accuracy of 91.3% and 88.9% on the verification set and test set separately after pretraining.

Then, we employ the well-trained DNN to predict the effect of USV waypoint behavior at different speeds and steering angles. The results of the experiments are presented below.

Figures 13 and 14 show the prediction of the DNN for different steering angles and speeds, respectively, without any changes in other parameters. Figures 13(a) and 14(a) present reference values, Figure 13(b) show the predicted values without pretraining, and Figures 13(c) and 14(c) display the predicted values with pretraining.

[figures omitted; refer to PDF]

It can be seen that the same trends are present in the predictions and the ground truth, and the unsupervised learning process using GB-RBM can improve the predicted accuracy.

4.3. The Effect of Trajectory Tracking

4.3.1. Simulation Preparation

As mentioned above, the goal of this study is to predict the waypoint behavior of USV through DNN to optimize its trajectory tracking effect. A comparative navigation simulation experiment based on MOOS-IvP platform is conducted. Two USVs with the same type and length (7 m) are deployed. The first USV named alpha does not use DNN, and the second one named Alder uses dual-DNN prediction model. Other than that, the configuration parameters of the two USVs are identical. The two USVs start from the same initial point and track a planned route consisting of five waypoints.

The dual-DNN prediction model adopted by the USV Alder is implemented by running pDeepLearning application to predict the behavior of the waypoint in real time. The experimental results are recorded by pLogger application in MOOS and extracted and analyzed by alogview toolbox.

The MOOS-IvP simulation platform is shown in Figure 15, in which the small window shows the situation of USV Alder running pDeepLearning for prediction. The main configuration parameters of two USVs and their waypoint behavior configurations are given in Table 5.

[figure omitted; refer to PDF]

Table 6 shows the performance quantitative indicators. From the perspective of mean error and variance, the values of USV by using the prediction model are smaller. From the view of the integrated absolute error (IAE) and the time integrated absolute error (ITAE), the USV with prediction model has better transient and steady-state performance.

Table 6

Performance index in trajectory tracking.

Vehicle name	Alder (with DNN model)	Alpha (without DNN model)
The average of tracking error	1.75	2.16
The variance of tracking error	12.70	16.43
IAE (10³) ( $\int_{0}^{t} e t d τ$ )	0.685	0.825
ITAE (10⁴) ( $\int_{0}^{t} t e t d τ$ )	4.691	5.499

4.3.4. The Effect of Waypoint Behavior

Figure 19 shows a comparison of waypoint behavior effect after modifying parameters by the prediction model. Among them, the green line is the waypoint behavior effect after optimization of LOS parameters by the prediction model, and the blue line is the waypoint behavior effect without modification of the prediction model. It is clear that the waypoint behavior effect is better after optimization. The black line shows the changes of steering angle, which serves as a reference, indicating that once the steering angle changes, the model will predict a new waypoint behavior effect level value.

[figure omitted; refer to PDF]

The effect of waypoint behavior is shown in Table 7; the effect was improved by 1 level after LOS parameters were adjusted by the prediction model.

Table 7

Performance indices in waypoint behavior effect evaluation.

Vehicle name	Adjusted by DNN model	Not adjusted
The average of level	1.8	2.8
The variance of level	1.36	1.36

4.3.5. The Effect of Prediction for LOS Parameters

Figure 20 shows the lead distance and lead damper values which are related to the guidance law predicted by DNN-2. The green line and blue line signify the predicted value of lead distance and lead damper, respectively. The black line signifies the change of steering angle and speed, respectively. The black line and the red line are used as the reference, indicating that when the steering angle changes, the model will predict new lead distance and lead damper. In addition, during the voyage, predicted lead damper values have been affected by the change of speed.

[figure omitted; refer to PDF]

4.3.6. The Influence on Speed and Course

Figure 21 shows the changes of speed and course during the voyage. The scale of Figure 21(a) is the entire navigation process, and Figure 21(b) is the amplification of the second steering process. The green line is the speed and course of USV Alder by using the prediction model, the blue line is the speed and course of USV alpha without prediction model, and the black dashed line is the course of advance. It can be seen that there is little difference between the changes of the two vehicles’ speed and course, which indicates that the intervention of waypoint behavior by using DNN does not cause rapid increase or decrease for speed and course. The method does not have too much influence on the control loop. It is reliable.

[figures omitted; refer to PDF]

From the above results in Section 4, we can make a summary:

(1) The ability of DNN model for trajectory tracking control based on GB-RBM optimization to evaluate the waypoint behavior effects has been greatly improved compared with the previous model. The accuracy of the test set has been improved by 5%, reaching 88.9%.

(2) After correction, the average trajectory tracking error of USV is reduced by 19.0%, and the waypoint behavior effect level has been raised by one level.

(3) The DNN prediction model which is applied to the trajectory tracking control of USV can evaluate the waypoint behavior effect before steering and adjust the parameters of guidance law in real time.

5. Conclusions

In this study, two prediction models based on DNN have been constructed, i.e., “DNN-1: waypoint behavior effect evaluation model” and “DNN-2: real-time LOS parameter valuation model.” The models are connected in parallel with the LOS guidance process for trajectory tracking of USV, which improves the effect of trajectory tracking obviously. The experimental results have demonstrated the positive effect of deep learning method on autonomous navigation of USV. The DNN has learned the mapping relationship between different features and effect levels in the waypoint behavior through the training of dataset. Through the real-time prediction of LOS parameters, the trajectory tracking error is reduced by about 1 times length of the vehicle.

We have developed a new MOOS application. To our knowledge, this is the first time that DNN is integrated dynamically into MOOS-IvP, a well-known marine autonomous platform. Although it only predicts the guidance process of USV, the enhancement is significant. In the future, it can further improve the overall capability of USV’s self-driving.

In addition, a dual-DNN model has been in collaboration with the prediction of trajectory tracking control process, which is also our first attempt. The experimental results have proved its feasibility, and we believe that the way of evaluating the waypoint behavior effect firstly and then executing the maneuvering according to the prediction is more in line with the regular mode of ship maneuvering and control in marine domain. It will be beneficial to improve the reliability of real-world scenarios.

References

[1] S. Campbell, W. Naeem, G. W. Irwin, "A review on improving the autonomy of unmanned surface vehicles through intelligent collision avoidance manoeuvres," Annual Reviews in Control, vol. 36 no. 2, pp. 267-283, DOI: 10.1016/j.arcontrol.2012.09.008, 2012.

[2] L. E. Kavraki, S. M. LaValle, Motion Planning, 2016.

[3] P. Svec, A. Thakur, E. Raboin, B. C. Shah, S. K. Gupta, "Target following with motion prediction for unmanned surface vehicle operating in cluttered environments," Autonomous Robots, vol. 36, pp. 383-405, 2014.

[4] T. I. Fossen, Handbook of Marine Craft Hydrodynamics and Motion Control, 2011.

[5] Y. Fan, H. Huang, Y. Tan, "Robust adaptive path following control of an unmanned surface vessel subject to input saturation and uncertainties," Applied Sciences, vol. 9 no. 9,DOI: 10.3390/app9091815, 2019.

[6] M. R. Benjamin, H. Schmidt, P. M. Newman, J. J. Leonard, "Nested Autonomy for unmanned marine vehicles with MOOS-IvP," Journal of Field Robotics, vol. 27 no. 6, pp. 834-875, DOI: 10.1002/rob.20370, 2010.

[7] W. L. Sun, X. Gao, "Predicting the Trajectory tracking control of unmanned surface vehicle based on deep learning," Artificial Intelligence in China, pp. 591-598, DOI: 10.1007/978-981-15-0187-6_70, 2020.

[8] J. Velagic, Z. Vukic, E. Omerdic, "Adaptive fuzzy ship autopilot for track-keeping," Control Engineering Practice, vol. 11 no. 4, pp. 433-443, DOI: 10.1016/s0967-0661(02)00009-6, 2003.

[9] A. P. Aguiar, L. Cremean, J. P. Hespanha, "Position tracking for a nonlinear underactuated hovercraft: controller design and experimental results," Proceedings of the 42nd IEEE Conference on Decision and Control, vol. 4, pp. 3858-3863, .

[10] C. Lv, H. Yu, J. Chi, "A hybrid coordination controller for speed and heading control of underactuated unmanned surface vehicles system," Ocean Engineering, vol. 176, pp. 222-230, DOI: 10.1016/j.oceaneng.2019.02.007, 2019.

[11] Y. Xia, K. Xu, Y. Li, G. Xu, X. Xiang, "Improved line-of-sight trajectory tracking control of under-actuated AUV subjects to ocean currents and input saturation," Ocean Engineering, vol. 174, pp. 14-30, DOI: 10.1016/j.oceaneng.2019.01.025, 2019.

[12] H. Huang, M. Gong, Y. Zhuang, S. Sharma, D. Xu, "A new guidance law for trajectory tracking of an underactuated unmanned surface vehicle with parameter perturbations," Ocean Engineering, vol. 175, pp. 217-222, DOI: 10.1016/j.oceaneng.2019.02.042, 2019.

[13] C. Li, Y. S. Zhao, G. Wang, Y. Fan, Y. Bai, "Adaptive RBF neural network control for unmanned surface vessel course tracking," Proceedings of Sixth International Conference on Information Science and Technology, .

[14] R. Skulstad, G. Li, H. Zhang, T. I. Fossen, "A neural network approach to control allocation of ships for dynamic positioning," IFAC-PapersOnLine, vol. 51 no. 29, pp. 128-133, DOI: 10.1016/j.ifacol.2018.09.481, 2018.

[15] X. Cheng, G. Li, R. Skulstad, "Data-driven uncertainty and sensitivity analysis for ship motion modeling in offshore operations," Ocean Engineering, vol. 179, pp. 261-272, DOI: 10.1016/j.oceaneng.2019.03.014, 2019.

[16] Y. Shuai, G. Li, X. Cheng, "An efficient neural-network based approach to automatic ship docking," Ocean Engineering, vol. 191,DOI: 10.1016/j.oceaneng.2019.106514, 2019.

[17] Z. Qiang, Z. Guibing, H. Xin, Y. Renming, "Adaptive neural network auto-berthing control of marine ships," Ocean Engineering, vol. 177, pp. 40-48, DOI: 10.1016/j.oceaneng.2019.02.031, 2019.

[18] Y. Wang, J. Tong, T. Y. Song, Z. H. Wan, "Unmanned surface vehicle course tracking control based on neural network and deep deterministic policy gradient algorithm," Proceedings of Oceans- MTS/IEEE Kobe Techno-Oceans (OTO), .

[19] B. Marius, D. D. Testa, D. Dworakowski, "End-to-end deep learning for self-driving cars," 2016. http://devblogs.nvidia.com/deep-learning-self-driving-cars

[20] J. S. Kim, D. J. Kim, S. H. Lee, C. C. Chung, "Autonomous driving vehicles with unmatched disturbance compensation using deep neural networks," Proceedings of 2019 19th International Conference on Control, Automation and Systems, .

[21] Q. Xu, Y. Yang, C. Zhang, L. Zhang, "Deep convolutional neural network-based autonomous marine vehicle maneuver," International Journal of Fuzzy Systems, vol. 20 no. 2, pp. 687-699, DOI: 10.1007/s40815-017-0393-z, 2018.

[22] Y. Z. Chen, Y. Y. Shi, B. S. Zhang, "Optimal control via neural networks: a convex approach," to Appear in International Conference on Learning Representations, .

[23] X. Tan, S. Su, Z. Zuo, X. Guo, X. Sun, "Intrusion detection of UAVs based on the deep belief network optimized by PSO," Sensors, vol. 19 no. 24,DOI: 10.3390/s19245529, 2019.

[24] R. Eldan, O. Shamir, "The power of depth for feedforward neural networks," JMLR: Workshop and Conference Proceedings, vol. 49, 2016.

[25] G. E. Hinton, R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313 no. 5786, pp. 504-507, DOI: 10.1126/science.1127647, 2006.

[26] S. Goudarzi, M. N. Kama, M. H. Anisi, S. A. Soleymani, F. Doctor, "Self-organizing traffic flow prediction with an optimized deep belief network for internet of vehicles," Sensors, vol. 18,DOI: 10.3390/s18103459, 2018.

[27] W. Zhao, L. Xu, J. Bai, M. Ji, T. Runge, "Sensor-based risk perception ability network design for drivers in snow and ice environmental freeway: a deep learning and rough sets approach," Soft Computing, vol. 22 no. 5, pp. 1457-1466, DOI: 10.1007/s00500-017-2850-x, 2018.

[28] T. Yamashita, M. Tanaka, E. Yoshida, Y. Yamauchi, H. Fujiyoshi, "To be Bernoulli or to be Gaussian, for a Restricted Boltzmann Machine," Proceedings of 2014 22nd International Conference on Pattern Recognition, .

[29] M. R. Benjamin, P. Newman, H. Schmidt, J. J. Leonard, "An overview of moos-ivp and a brief users guide to the ivp helm autonomy software," vol. 28, 2009. Technical Report

[30] L. Y. Dong, H. L. Xu, "Design of heading control system for USV based on MOOS-IvP," Proceedings of the IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), .

[31] N. Dula, D. Vladimir, "Command Filtered backstepping design in MOOS-IvP helm framework for trajectory tracking of USVs," .

[32] M. R. Benjamin, M. Defilippo, P. Robinette, M. Novitzky, "Obstacle avoidance using multiobjective optimization and a dynamic obstacle manager," IEEE Journal of Oceanic Engineering, vol. 44 no. 2, pp. 331-342, DOI: 10.1109/joe.2019.2896504, 2019.

[33] F. A. Papoulias, "Bifurcation analysis of line of sight vehicle guidance using sliding modes," International Journal of Bifurcation and Chaos, vol. 01 no. 4, pp. 849-865, DOI: 10.1142/s0218127491000622, 1991.

Word count: 6712

Show less

Copyright © 2021 Wenli Sun and Xu Gao. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/

Abstract

Translate

Trajectory tracking control based on waypoint behavior is a promising way for unmanned surface vehicle (USV) to achieve autonomous navigation. This study is aimed at the guidance progress in the kinematics; the artificial intelligence method of deep learning is adopted to improve the trajectory tracking level of USV. First, two deep neural network (DNN) models are constructed to evaluate navigation effects and to estimate guidance law parameters in real time, respectively. We then pretrain the DNN using a Gaussian–Bernoulli restricted Boltzmann machine to further improve the accuracy of predicting navigation effect. Finally, two DNNs are connected in parallel with the control loop of USV to provide predictive supervision and auxiliary decision making for traditional control methods. This kind of parallel way conforms to the ship manipulation of habit. Furthermore, we develop a new application on the basis of Mission Oriented Operating Suite Interval Programming named “pDeepLearning.” It can predict the navigation effect online by DNN and adjust the guidance law parameters according to the effect level. The experimental results show that, compared with the original waypoint behavior of USV, the prediction model proposed in this study reduces the trajectory tracking error by 19.0% and increases the waypoint behavior effect level.

Details

Title

Deep Learning-Based Trajectory Tracking Control forUnmanned Surface Vehicle

Author

Sun, Wenli¹; Gao, Xu²

¹ Navigation College, Dalian Maritime University, Dalian 116026, China
² National Engineering Research Center of Maritime Navigation System, Dalian Maritime University, Dalian 116026, China

Editor

Giuseppe D'Aniello

Publication year

2021

Publication date

2021

Publisher

John Wiley & Sons, Inc.

ISSN

1024123X

e-ISSN

15635147

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1155/2021/8926738

ProQuest document ID

2478359219

Deep Learning-Based Trajectory Tracking Control forUnmanned Surface Vehicle

Jump to:

Full Text

Abstract

Details

Suggested sources