A Novel Air Pollutant Concentration Prediction

Full text

Turn on search term navigation

1. Introduction

With the continuous promotion of global industrialization and urbanization, the environmental problems accumulated by rapid development, especially the problem of air pollution, have become increasingly prominent. The harm caused by air pollution not only seriously affects human health [1,2] but also causes substantial economic losses to all countries [3]. The specific numerical results show that the rise of PM_2.5 concentration by 10 g/m³ can increase mortality by 2% [4] and in 2016, bring a total loss of 101.39 billion dollars to 338 cities in China [5]. Therefore, it is imperative to establish an accurate, reliable, and effective air pollutant concentration prediction (APCP) system to announce the expected pollutant concentration to the public in the successive few periods, which is also a research hotspot and challenging issue in the field of systems engineering.

The state-of-the-art work for the prediction of air pollutants (APs) mainly includes deterministic models [6] and data-driven models [7]. The deterministic model relies on the emission source data and various historical meteorological data to simulate the formation of pollutants through the physical and chemical processes of the generation, conversion, and diffusion of APs [8]. For example, CMaq [9,10,11], Wrf-Chem [12,13,14], and LOtos-EUros [15] are all popular deterministic models. W. Wei et al. [16] adopted the Wrf-Chem model to effectively simulate the concentration of APs in Beijing. Based on this model, the impact of pollution emissions on air quality is quantified. However, the complexity of climate, land use, and emission sources is difficult to capture, which limits the development of such models to a certain extent [17].

Relying on the continuous in-depth study of algorithm models and the continuous improvement of computer hardware performance, data-driven models [18] have become the preferred method of prediction, including linear models [19] and nonlinear models [20]. Classical linear models, such as autoregressive integrated moving average (ARima) models [21] and seasonal autoregressive integrated moving average (SARima) [22], are based on the assumption that the studied sequence is linear. In addition, quantile regression (Qr) [23] and other models are also widely used in the field of AP concentration prediction. T. Liu et al. [24] integrated ARima model with three deterministic models (ARimaX) to predict APs, respectively. Taking the results of CMaq-ARima model as an example, the mean of daily root mean squared error (RMSE) at the three stations decreased by $\bar{R M S E^{N O_{2}}} = (46.3 % + 43.4 % + 41.2 %) / 3 = 43.6 %$ . S. Abdullah et al. [25] fitted the multiple linear regression (MLR) model to study the cross-border impact of APs on Malaysia. Based on the MLR model, the values of the next one, two, and three time points in the input sequence of PM₁₀ concentration were predicted, that is, the 1–3 step prediction was performed in the experiment. It is verified that the result of 1-step prediction is the best, in which $R^{2} = 0.638$ and $R M S E = 126.728 μ g / m^{3}$ . Due to the prevalence of multicollinearity, N. Mohd Napi et al. [26] applied principal component analysis (PCA) to reduce the high correlation between the data and then used MLR to predict the ozone concentration. It was shown that the value of R² was improved by $(R_{P C A - M L R}^{2} -$ $R_{M L R}^{2}) / R_{M L R}^{2} =$ $(0.405 - 0.325) / 0.325 \times$ $100 % = 24.615 %$ compared with MLR model.

Although the application of the linear model is relatively simple, in practice, the characteristics of the AP series appear nonlinear and time-varying complexity. Thus, nonlinear models are widely utilized. Y. Bai et al. [27] combined wavelet transform technology with the back propagation (BP) model, named the W-BPnn model, to improve the nonlinear mapping performance of the prediction model. Compared with the BP model, the value of RMSE was reduced by $(R M S E_{B P}^{S O_{2}} - R M S E_{W - B P n n}^{S O_{2}}) / R M S E_{B P}^{S O_{2}} =$ $(12.716 - 8.269) / 12.716 \times$ $100 % = 34.972 %$ . Because the sequence of APs concentration has long-term correlation characteristics, scientifically determining the lag order can avoid the gradient disappearance of artificial neural networks (ANNs). Considering the relationship between space and time, X. Li et al. [28] extended the Lstm model (LstmE) to predict multi-scale APs, which successfully realized the scientific identification of lag order. The comparison results showed that with the extension of prediction time, the required lag order gradually increased. Although the error also increased, it was still within the acceptable interval of long-term prediction results. In order to reduce the noise of input data, X. Jiang et al. [29] split the original sequence with the complete ensemble empirical mode decomposition with adaptive noise (CEemdan) strategy and entropy method, and then input the subsequences into the bidirectional Lstm (Bilstm) model for prediction respectively. This method greatly improved the efficiency of the prediction model.

The individual ANN model is lack flexibility for the adjustment of parameters, which limits the improvement of prediction accuracy [30]. The optimization algorithm provides a new solution for improving the prediction performance [31,32]. J. Murillo-Escobar et al. [33] designed the Svr-Pso model to estimate the concentration of APs in five regions. Compared with the feedforward artificial neural network (FAnn) model used in the San Buenaventura University in Bello (BEL-USBV) ( $R M S E_{T M V - T P C - F A n n}^{B E L - U S B V - O_{3}} = 11.77 ppm$ ), Pso optimized the prediction performance of the hybrid model ( $R M S E_{T M V - T P C - S v r - P s o}^{B E L - U S B V - O_{3}} = 9.43 ppm$ ). Q. Wu et al. [34] constructed the Ba-LSsvm model to predict the decomposed low-frequency data, while the high-frequency data were simulated after secondary decomposition. Ba-LSsvm model not only produced better prediction results than Pso optimization, for example, the value of mean absolute percentage error (MAPE) is reduced by $M A P E_{P s o}^{B e i j i n g} - M A P E_{B a}^{B e i j i n g} =$ $0.1273 - 0.0877 =$ $0.0396$ but also included the function of external variables. Similarly, G. Li et al. [35] proposed the CEemdan-Dse-BVmd-Csa-KElm model to disintegrate the data with high complexity to simplify the input sequence. Csa mechanism searched the optimal adjustable parameters of the KElm model, which was conducive to configuring the model on demand and realizing effective prediction in multiple fields. However, the optimization mechanism only focuses on the improvement of the precision of model prediction, and the importance of model robustness is ignored. To balance various performances, a series of multi-objective optimization mechanisms [36,37] represented by the multi-objective dragonfly algorithm (Moda) [38], multi-objective chameleon swarm algorithm (MOcsa) [39], and multi-objective salp swarm algorithm (MSsa) [40] have achieved good results in energy prediction. It can be considered to develop a multi-objective optimization mechanism to weigh the prediction performance of APs. In addition, some of the latest research models are evaluated in Table 1.

According to the analysis of existing literature, the shortcomings of different prediction models can be summarized as follows: (i) Deterministic models are highly professional and expensive, which limits the extendability of the model. (ii) The data-driven linear model is limited by linear assumptions, while the actual data is a mixture of linear and nonlinear characteristics. Only a single feature is extracted, which causes the loss of input information. (iii) Although the optimization mechanism avoids the randomness of parameter setting, in most cases, the accuracy of the model is only set as the unique objective function, which lacks the measurement of the stability of the prediction model. Therefore, to solve these drawbacks, a novel air pollutant concentration prediction (APCP) system is proposed for environmental system management in this paper. It is mainly established by four modules, namely time series reconstruction, submodel simulation, weight search, and integration. Specifically, in the time series reconstruction module, the decomposition-ensemble (DE) mode is constructed to filter the redundancy of the original sequence and then merge similar features. The obtained reconstruction sequence is trained by four prediction submodels (convolutional neural network (Cnn), long short-term memory (Lstm), extreme learning machine (Elm), and Elman neural network (Enn)) as input in the submodel simulation module. The outputs of four independent prediction sequences are assigned appropriate weights applying the weight search (Ws) mechanism. Based on this optimal weight, the final prediction results of the APCP system are integrated.

The main contributions of this paper are introduced as follows:

(1) DE mode is designed in the APCP systemto filter out the interference of high-frequency noise signals. The sequence of APs contains more information and has strong volatility. The DE mode is applied to reconstruct the original sequence, which not only eliminates the interference of high-frequency noise signals but also integrates the key information.

(2) A novel weight search mechanism, Ws, is developed in the weight search module. This mechanism makes up for the limitations of the previous optimization mechanism, such as the single objective function, ease of falling into the dilemma of local optimization, and long running time. By the comparative experiment, it can be proved that the weight search mechanism used in this paper is superior to previous optimization mechanisms both in searchability and stability.

(3) The construction of the four prediction submodels in the APCP system covers the inherent modes of sequence as much as possible, including three ANN models (Cnn, Elm, and Enn) and a deep learning model called Lstm. The four models make use of each other’s advantages to fill in the deficiencies and comprehensively capture sequence modes. Based on the extracted sequence modes, the prediction performance of the APCP system is improved.

(4) The proposed APCP system achieves the trade-off between double objectives, prediction precision, and robustness, in the paper. It comprises four modules: time series reconstruction, submodel simulation, weight search, and integration. Based on hourly PM_2.5 concentration data in Guangzhou, Shanghai, and Chengdu, China, the superiority of the APCP system is testified by three experiments and discussions. It can also provide scientific data support for air pollution control and policy-making.

The remainder of the paper is arranged as follows. The next section introduces the design of the APCP system, including the construction principles of decomposition-ensemble mode and weight search mechanism, and the structure of the APCP system. Section 3 describes three experiments to verify the superiority of the proposed APCP system. Three discussions, which are significance analysis, correlation analysis, and sensitivity analysis, are carried out in Section 4. Section 5 summarizes the conclusions and prospects for the further development of the APCP system.

2. Design of the APCP System

The APCP system is established by four modules, which are time series reconstruction, submodel simulation, weight search, and integration. As depicted visually in Figure 1, firstly, the original sequence is split into subsequences of different frequencies by DE mode, and noise is filtered out. After recombining the subsequences according to the similarity, the multi-step prediction is realized in four individual prediction submodels, respectively. Next, under the constraint of the multi-objective function, the weight search mechanism is constructed to determine the appropriate weights of the four submodels. Finally, the multi-step prediction results are output based on the obtained weights.

In this section, before the structure of the APCP system is elaborated in detail, the DE mode and weight search mechanism applied to the system are designed as follows.

2.1. Decomposition-Ensemble Mode

The concentration series of AP has a complex structure and frequent fluctuations, which will cause a computational burden if directly input into the prediction model. Therefore, the paper designs the DE mode to divide the sequence structure and clarify the modal characteristics.

First, ${\tilde{Λ}}_{v}^{1} \sim N (0, 1), v = 1, 2, \dots, V$ is added to the original concentration sequence $C^{A P}$ to form a new fluctuation sequence $C_{v}^{1} = C^{A P} + ι^{1} {\tilde{Λ}}_{v}^{1}$ , where $N (0, 1)$ means standard normal distribution and $ι^{1}$ is a random number. Then, the upper ${\overset{⌢}{C}}_{v}^{1} = Δ^{3} (\overset{⌢}{E} (C_{v}^{1}))$ and lower ${\underset{⌣}{C}}_{v}^{1} = Δ^{3} (\underset{⌣}{E} (C_{v}^{1}))$ boundaries of $C_{v}^{1}$ are calculated, where $Δ^{3} (\cdot)$ is the cubic spline difference function, and $\overset{⌢}{E} (C_{v}^{1})$ and $\underset{⌣}{E} (C_{v}^{1})$ represent local maximum and minimum value sequences of $C_{v}^{1}$ , respectively. The first component ${\tilde{C}}_{v}^{1} = C_{v}^{1} - ({\overset{⌢}{C}}_{v}^{1} + {\underset{⌣}{C}}_{v}^{1}) / 2$ is obtained. If ${\tilde{C}}_{v}^{1}$ satisfies Equation (1), then the first mode component ${\tilde{T}}_{v}^{1}$ can be expressed as ${\tilde{T}}_{v}^{1} = {\tilde{C}}_{v}^{1}$ , otherwise, let $C_{v}^{1} = {\tilde{C}}_{v}^{1}$ and repeat the above process.

(1) $\{\begin{array}{l} |N [\overset{⌢}{E} ({\tilde{C}}_{v}^{1}) + \overset{⌢}{E} ({\tilde{C}}_{v}^{1})] - N [Z ({\tilde{C}}_{v}^{1})]| \leq 1 \\ \begin{matrix} [{\overset{⌢}{\tilde{C}}}_{v}^{1, c} + {\underset{⌣}{\tilde{C}}}_{v}^{1, c}] / 2 = 0, & \forall c = 1, 2, \dots, C \end{matrix} \end{array}$

where

N [\cdot]

stands for counting function,

Z ({\tilde{C}}_{v}^{1})

represents the set of points with zero value in the sequence of

{\tilde{C}}_{v}^{1}

{\overset{⌢}{\tilde{C}}}_{v}^{1, c}

, and

{C_{⌣}^{˜}}_{v}^{1, c}

indicate the upper and lower boundaries of

{\tilde{C}}_{v}^{1}

at time point c, and C is the total number of time points in the

{\tilde{C}}_{v}^{1}

Based on this, the first intrinsic mode function (IMF) $\bar{{\tilde{T}}^{1}}$ and residual sequence $ξ^{1}$ are listed in Equations (2) and (3).

(2) $\begin{array}{l} \bar{{\tilde{T}}^{1}} & = \sum_{v = 1}^{V} {\tilde{T}}_{v}^{1} / V = \sum_{v = 1}^{V} {\tilde{C}}_{v}^{1} / V = \sum_{v = 1}^{V} [C_{v}^{1} - ({\overset{⌢}{C}}_{v}^{1} + {\underset{⌣}{C}}_{v}^{1}) / 2] / V \\ = \sum_{v = 1}^{V} \{C_{v}^{1} - [Δ^{3} (\overset{⌢}{E} (C_{v}^{1})) + Δ^{3} (\underset{⌣}{E} (C_{v}^{1}))] / 2\} / V \\ = \sum_{v = 1}^{V} \{C^{A P} + ι^{1} {\tilde{Λ}}_{v}^{1} - [Δ^{3} (\overset{⌢}{E} (C^{A P} + ι^{1} {\tilde{Λ}}_{v}^{1})) + Δ^{3} (\underset{⌣}{E} (C^{A P} + ι^{1} {\tilde{Λ}}_{v}^{1}))] / 2\} / V \\ = C^{A P} + \sum_{v = 1}^{V} \{ι^{1} {\tilde{Λ}}_{v}^{1} - [Δ^{3} (\overset{⌢}{E} (C^{A P} + ι^{1} {\tilde{Λ}}_{v}^{1})) + Δ^{3} (\underset{⌣}{E} (C^{A P} + ι^{1} {\tilde{Λ}}_{v}^{1}))] / 2\} / V \end{array}$

(3) $\begin{array}{l} ξ^{1} = C^{A P} - \bar{{\tilde{T}}^{1}} \\ = - \sum_{v = 1}^{V} \{ι^{1} {\tilde{Λ}}_{v}^{1} - [Δ^{3} (\overset{⌢}{E} (C^{A P} + ι^{1} {\tilde{Λ}}_{v}^{1})) + Δ^{3} (\underset{⌣}{E} (C^{A P} + ι^{1} {\tilde{Λ}}_{v}^{1}))] / 2\} / V \\ = \sum_{v = 1}^{V} \{[Δ^{3} (\overset{⌢}{E} (C^{A P} + ι^{1} {\tilde{Λ}}_{v}^{1})) + Δ^{3} (\underset{⌣}{E} (C^{A P} + ι^{1} {\tilde{Λ}}_{v}^{1}))] / 2 - ι^{1} {\tilde{Λ}}_{v}^{1}\} / V \end{array}$

Similarly, let $C_{v}^{2} = ξ^{1} + ι^{2} {\tilde{Λ}}_{v}^{2}, {\tilde{Λ}}_{v}^{2} \sim N (0, 1)$ and all the above steps are executed again. The results of $\bar{{\tilde{T}}^{2}}$ and $ξ^{2}$ are expressed in Equations (4) and (5).

(4) $\bar{{\tilde{T}}^{2}} = ξ^{1} + \sum_{v = 1}^{V} \{ι^{2} {\tilde{Λ}}_{v}^{2} - [Δ^{3} (\overset{⌢}{E} (ξ^{1} + ι^{2} {\tilde{Λ}}_{v}^{2})) + Δ^{3} (\underset{⌣}{E} (ξ^{1} + ι^{2} {\tilde{Λ}}_{v}^{2}))] / 2\} / V$

(5) $ξ^{2} = \sum_{v = 1}^{V} \{[Δ^{3} (\overset{⌢}{E} (ξ^{1} + ι^{2} {\tilde{Λ}}_{v}^{2})) + Δ^{3} (\underset{⌣}{E} (ξ^{1} + ι^{2} {\tilde{Λ}}_{v}^{2}))] / 2 - ι^{2} {\tilde{Λ}}_{v}^{2}\} / V$

And so on, until $ξ$ is a monotonic function and DE mode is completed. All output results ( $\bar{{\tilde{T}}^{l}}, l = 1, 2, \dots, L$ and $ξ$ ) meet Equation (6).

(6) $C^{A P} = \sum_{l = 1}^{L} \bar{{\tilde{T}}^{l}} + ξ$

2.2. Weight Search Mechanism

To improve the comprehensive ability of the APCP system, the weight allocation of four individual submodels is the key to the successful application under the constraints of multi-objective functions. As an effective weight search mechanism, the multi-objective grasshopper optimization algorithm (Mogoa) is developed in the APCP system, and its structure is constructed as follows. Initially, the grasshopper population $Γ$ with $α$ individuals and $β$ dimensions is listed in Equation (7).

(7) $Γ = [\begin{matrix} τ_{1}^{1} & τ_{2}^{1} & \dots & τ_{β}^{1} \\ τ_{1}^{2} & τ_{2}^{2} & \dots & τ_{β}^{2} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ τ_{1}^{α} & τ_{2}^{α} & \dots & τ_{β}^{α} \end{matrix}]$

According to [52], the location of $Γ^{g}, g = 1, 2, \dots, α$ is not only affected by the other individuals $Γ^{g^{'}}, g^{'} \neq g$ , but also related to gravity $Θ^{g}$ and wind direction $Ξ^{g}$ , as shown in Equation (8).

(8) $Γ^{g} = λ_{1} \sum_{\begin{matrix} g^{'} = 1 \\ g^{'} \neq g \end{matrix}}^{α} \{[ε_{1} \cdot e x p (- \frac{|Γ^{g^{'} g}|}{ε_{2}}) - e x p (- |Γ^{g^{'} g}|)] \cdot \frac{Γ^{g^{'} g}}{|Γ^{g^{'} g}|}\} - λ_{2} Θ^{g} + λ_{3} Ξ^{g}$

where

Γ^{g^{'} g} = Γ^{g^{'}} - Γ^{g}

ε_{1}

and

ε_{2}

are two attractive parameters, and

λ_{k} = r a n d (1)

k = 1, 2, 3

Due to insufficient convergence, Equation (8) cannot realize the search for optimal weight. An improved convergence function is developed in the APCP system, as described in Equation (9). It is assumed that the change of $Γ_{h}^{g}, g = 1, 2, \dots, α; h = 1, 2, \dots β$ is not affected by $Θ^{g}$ , and $Ξ^{g}$ remains towards the food source $Ω_{h}^{F}$ .

(9) $Γ_{h}^{g} = δ \sum_{\begin{matrix} g^{'} = 1 \\ g^{'} \neq g \end{matrix}}^{α} \{\frac{δ ({\overset{⌢}{Γ}}_{h} - {\underset{⌣}{Γ}}_{h}) Γ^{g^{'} g}}{2 |Γ^{g^{'} g}|} \cdot [ε_{1} \cdot e x p (- \frac{|Γ_{h}^{g^{'} g}|}{ε_{2}}) - e x p (- |Γ_{h}^{g^{'} g}|)]\} + Ω_{h}^{F}$

where

{\overset{⌢}{Γ}}_{h}

and

{\underset{⌣}{Γ}}_{h}

are inequality constraint range of

Γ_{h}^{g}

Γ_{h}^{g^{'} g} = Γ_{h}^{g^{'}} - Γ_{h}^{g}

h = 1, 2, \dots, β

, and

δ

satisfy Equation (10).

(10) $δ = \frac{(I T e r - η) \overset{⌢}{δ} + η \underset{⌣}{δ}}{I T e r}$

where

\overset{⌢}{δ} = m a x (δ)

\underset{⌣}{δ} = m i n (δ)

η

indicates the current times, and ITer expresses the total number of cycles.

If the location of $Γ_{h}^{g}$ moves beyond the boundaries ( ${\underset{⌣}{Γ}}_{h}$ and ${\overset{⌢}{Γ}}_{h}$ ), $Γ_{h}^{g}$ is reset by Equation (11).

(11) $Γ_{h}^{g} = \{\begin{array}{l} {\underset{⌣}{Γ}}_{h} & i f & Γ_{h}^{g} ţ {\underset{⌣}{Γ}}_{h} \\ {\overset{⌢}{Γ}}_{h} & i f & Γ_{h}^{g} ţ {\overset{⌢}{Γ}}_{h} \\ Γ_{h}^{g} & i f & {\underset{⌣}{Γ}}_{h} ţ Γ_{h}^{g} ţ {\overset{⌢}{Γ}}_{h} \end{array}$

When $η = I T e r$ , the output results are the suitable weights $Π$ of the APCP system. The pseudo-code of the weight search mechanism is listed in Algorithm 1.

Algorithm 1. Weight search mechanism

Input:

\vec{Ψ} = ({\vec{ϕ}}^{1}, {\vec{ϕ}}^{2}, \dots, {\vec{ϕ}}^{N (T R a i n)})

Outputs:F—the best fitness results

𝛑

—the suitable weightsParameters:ITer—the iteration number

η

—the current iteration number

Γ^{g}

—the location of

g

-th grasshopper

α

—the number of

Γ

β

—the dimension of

Γ

ARchIve_max—the archive sizeARchIve_num—the number of repositories1: /* Initialize

Γ

. */2: /*Set

\overset{⌢}{δ}

\underset{⌣}{δ}

, and ITer.*/3. /*Calculate F of each search agent.*/4. /*

𝛑 = B e s t ⟨S e a r c h_A g e n t⟩

.*/5. WHILE (

η < I T e r

) DO6. /*Update coefficient

δ

.*/7.

δ = \frac{(I T e r - η) \overset{⌢}{δ} + η \underset{⌣}{δ}}{I T e r}

8. FOR (each search agent) DO9. /*Normalize

|Γ^{g^{'}} - Γ^{g}| \in [1, 4]

.*/10. /*Update

Γ_{h}^{g}

.*/11.

Γ_{h}^{g} = δ \sum_{\begin{matrix} g^{'} = 1 \\ g^{'} \neq g \end{matrix}}^{α} \{\frac{δ ({\overset{⌢}{Γ}}_{h} - {\underset{⌣}{Γ}}_{h}) Γ^{g^{'} g}}{2 |Γ^{g^{'} g}|} \cdot [ε_{1} \cdot e x p (- \frac{|Γ_{h}^{g^{'} g}|}{ε_{2}}) - e x p (- |Γ_{h}^{g^{'} g}|)]\} + Ω_{h}^{F}

12. /*Reset

Γ_{h}^{g}

if it moves beyond the boundaries.*/13.

Γ_{h}^{g} = \{\begin{array}{l} {\underset{⌣}{Γ}}_{h} & i f & Γ_{h}^{g} < {\underset{⌣}{Γ}}_{h} \\ {\overset{⌢}{Γ}}_{h} & i f & Γ_{h}^{g} > {\overset{⌢}{Γ}}_{h} \\ Γ_{h}^{g} & i f & {\underset{⌣}{Γ}}_{h} \leq Γ_{h}^{g} \leq {\overset{⌢}{Γ}}_{h} \end{array}

14. /*Update

𝛑

if a better solution is produced.*/15. /*Calculate F of each search agent.*/16. /*Identify the non-dominated solutions.*/17. /*Extended repository based on the non-dominated solutions.*/18. IF

(A R c h I v e_{n u m} > A R c h I v e_{m a x})

DO19. /* Start the repository maintainer to remove one repository resident.*/20. /*Put the new non-dominated solution into it.*/21. END IF22. END FOR23.

η = η + 1

24. END WHILE25. RETURN

𝛑

2.3. Framework of the APCP System

The APCP system is designed into four modules, as described in Figure 1, which are independent and coordinated. In this subsection, the structural design of each module is carefully explained to prove the scientific nature and rationality of the proposed system.

2.3.1. Time Series Reconstruction

Since only the influence of the historical AP series is considered, extracting as much information as possible from the original series is the premise to enhance the APCP system performance. The frequent fluctuation and disordered noise bring a burden to the prediction model. Therefore, the DE mode is designed to split the concentration series of AP to clarify the structure by increasing quantity. Based on the principle of similarity, the AP series is reconstructed. It not only reduces redundant interference but also ensures the integrity of series information.

2.3.2. Submodel Simulation

Using the PM_2.5 data of Guangzhou, Shanghai, and Chengdu in January 2015, the pre-experiment, including Cnn, Bilstm, least square support vector machine (Lssvm), gaussian process (Gp), Lstm, gate recurrent unit (Gru), Elm, and Enn models, is carried out to select the appropriate prediction submodels. From the results of the four evaluation indicators in Table 2, the prediction ability of the Cnn, Lstm, Elm, and Enn models is similar $\vec{M A P E} = [M A P E_{C n n}^{A v e .}, M A P E_{L s t m}^{A v e .}, M A P E_{E l m}^{A v e .}, M A P E_{E n n}^{A v e .}] = [7.751 %, 7.630 %,$ $7.654 %, 7.815 %]$ and superior to other models. Thus, Cnn, Lstm, Elm, and Enn as effective prediction tools are applied in the submodel simulation module.

2.3.3. Weight Search

The prediction ability of the individual submodel is limited, and it fails to fit all the characteristics of the AP series. The weight search mechanism can provide a new solution for identifying information from multiple perspectives by integrating all submodels. The APCP system aims to balance the prediction precision and robustness, so the objective functions of the weight search mechanism are shown in Equation (12). After the iteration, the optimal weights $Π$ of the four submodels are output.

(12) $\min \{\begin{array}{l} o b j_{I} = m e a n (|\frac{O_{σ} (596 t h - 744 t h) - F i t_{σ} (596 t h - 744 t h)}{O_{σ} (596 t h - 744 t h)}|) \\ o b j_{I I} = s t d [O_{σ} (596 t h - 744 t h) - F i t_{σ} (596 t h - 744 t h)] \end{array}$

where

O_{σ} (\cdot)

means the original series,

F i t_{σ} (\cdot)

is the fitting results, and

596 t h - 744 t h

represents the AP series used for testing.

2.3.4. Integration

According to Equation (13), the fitting values $ϑ$ of the four submodels are integrated with the corresponding optimal weights $𝛑$ to obtain the final prediction results ${\hat{Φ}}_{A P C P}$ .

(13) ${\hat{Φ}}_{A P C P} = \hat{φ} (ϑ, 𝛑) = \sum_{s = 1}^{4} π^{s} ϑ^{s}$

where

\sum_{s = 1}^{4} π^{s} = 1

Furthermore, to ensure the best performance of the APCP system, the parameters of all models included in the system are default as listed in Table 3, which is set by trial-and-error results.

3. Establishment and Evaluation of Experiment

Taking the hourly PM_2.5 concentration data of Guangzhou, Shanghai, and Chengdu as an example, this section makes a comparative analysis of three experiments on the APCP system, which aims to explain the performance of the proposed system from multiple perspectives.

3.1. Description of Datasets

This paper adopts hourly PM_2.5 concentration data from three cities in China, namely Guangzhou (Gz), Shanghai (Sh), and Chengdu (Cd), in January 2015, which are from http://archive.ics.uci.edu/ (accessed on 8 July 2022). The division of datasets and basic information such as maximum (Max.), minimum (Min.), mean (Mean), standard deviation (Std.), and maximum Lyapunov exponent (Mlye) are shown in Table 4.

According to the results of statistical evaluation, this month, the average PM_2.5 concentration in Chengdu is the highest $[M e a n_{T o t a l}^{G z}, M e a n_{T o t a l}^{S h}, M e a n_{T o t a l}^{C d}] = [34.155, 55.563,$ $64.711]$ and has a large fluctuation range $[S t d ._{T o t a l}^{G z}, S t d ._{T o t a l}^{S h}, S t d ._{T o t a l}^{C d}] = [69.609, 84.036,$ $141.839]$ . Besides, the Mlye values of the three cities are greater than zero, suggesting that the experimental data are chaotic. It proves the necessity of the time series reconstruction module in the APCP system.

3.2. Evaluation Indexes

For convenience to compare the experimental results, four evaluation criteria are utilized, including MAPE, MRE, MSE, and R². The expressions are given in Table 5.

3.3. Experiment I: Comparison with the Individual Models

Experiment I analyzes the multi-step prediction results of the APCP system compared with well-known models, which are Bilstm, Gru, Gp, and Lssvm models, and individual submodels, including Cnn, Lstm, Elm, and Enn models, as revealed in Table 6 and Figure 2. In order to ensure fairness, the data of the first four hours are used as input to predict the value of the fifth time point in all models. Moreover, the parameter settings of all models are shown in Table 3.

(a) For the dataset in Gz, the APCP system produces the best results in all measurement indicators. For the 1-step prediction, the MSE value of the APCP system is only $M S E_{A P C P}^{G z - 1 - s t e p} = 20.861$ , while the minimum value of the other individual models is $M S E_{L s t m}^{G z - 1 - s t e p} = 22.180$ . Similar results are also found in MRE and MAPE values. Based on the positive indicator, the R² value of the APCP system is $R_{A P C P}^{2} = 0.965$ , which is higher than the average result of the four well-known individual models $\bar{R^{2}} = (R_{B i l s t m}^{2} + R_{G r u}^{2} + R_{G p}^{2} + R_{L s s v m}^{2}) / 4 = (0.947 + 0.943 + 0.832 + 0.950) / 4 = 0.918$ . From the evaluation index of the 2-step prediction, the MRE value of the APCP system $M R E_{A P C P}^{G z - 2 - s t e p} = - 0.014$ is even better than that of the Gru and GP models in the 1-step prediction ( $M R E_{G r u}^{G z - 1 - s t e p} = 0.029$ and $M R E_{G p}^{G z - 1 - s t e p} = 0.057$ ). Besides, the performance of the 3-step prediction in the APCP system $M A P E_{A P C P}^{G z - 3 - s t e p} = 9.710 %$ has little difference from that of the 2-step prediction $M A P E_{A P C P}^{G z - 2 - s t e p} = 9.552 %$ , which indicates that the proposed system has better stability.

(b) For the dataset in Sh, although the overall performance of the APCP system is not as good as that of the dataset in Gz, it is still superior to the other individual models. For the 1-step prediction, the advantage of the APCP system is not outstanding $\vec{R^{2}} = [R_{A P C P}^{2}, R_{C n n}^{2}, R_{L s t m}^{2}, R_{E l m}^{2}, R_{E n n}^{2}] =$ $[0.976, 0.973, 0.975, 0.974, 0.974]$ . Because the individual submodels show good prediction capacity, the improvement space of the APCP system is limited. However, with the increase in prediction steps, the superiority of the APCP system becomes more and more obvious. Based on the evaluation results of MSE, the APCP system is $M S E_{E l m}^{S h - 2 - s t e p} -$ $M S E_{A P C P}^{S h - 2 - s t e p} =$ $159.395 - 143.385 = 16.010$ lower than the Elm model, which has the optimal result among all individual models, in the 2-step prediction. In the 3-step prediction, the APCP system is $M S E_{C n n}^{S h - 3 - s t e p} - M S E_{A P C P}^{S h - 3 - s t e p} =$ $160.774 - 147 . 137 = 13.637$ less than the Cnn model with the best results. It is worth noting that the MAPE of the APCP system in the 3-step $M A P E_{A P C P}^{S h - 3 - s t e p} = 14.994 %$ is less than that in the 2-step $M A P E_{A P C P}^{S h - 2 - s t e p} = 15.477 %$ .

(c) For the dataset in Cd, the R² value is the largest, which indicates that historical data has the strongest ability to interpret the predicted value at the next time point. For example, in the 3-step prediction of the APCP system, R² in Cd $R_{A P C P}^{2} =$ $0.952 (C d - 3 - s t e p)$ is much higher than that in Gz $R_{A P C P}^{2} = 0.854 (G z -$ $3 - s t e p)$ and Sh $R_{A P C P}^{2} = 0.894 (S h - 3 - s t e p)$ . Due to the frequent fluctuations of the dataset in Cd, the APCP system still achieves exact prediction in the case that the measurement errors of the models are large. Compared with the average value of MSE in four well-known models $\bar{M S E^{C d - 2 - s t e p}} = (M S E_{B i l s t m}^{C d - 2 - s t e p} + M S E_{G r u}^{C d - 2 - s t e p} + M S E_{G p}^{C d - 2 - s t e p} + M S E_{L s s v m}^{C d - 2 - s t e p}) / 4 =$ $(203.926 + 520.354 + 368.929 + 183.649) / 4 = 319.2145$ , the MSE of the APCP system is only $M S E_{A P C P}^{C d - 2 - s t e p} = 157.888$ . For the 3-step prediction results, though the Cnn model performs the best $[M R E_{C n n}^{C d - 3 - s t e p}, M A P E_{C n n}^{C d - 3 - s t e p}] = [0.062,$ $14.993 %]$ of all individual models, it is inferior to the APCP system $[M R E_{A P C P}^{C d - 3 - s t e p},$ $M A P E_{A P C P}^{C d - 3 - s t e p}] = [0.027,$ $13.810 %]$ .

Remark 1.

Compared with all individual models, including Bilstm, Gru, Gp, Lssvm, Cnn, Lstm, Elm, and Enn models, the APCP system has perfect prediction performance. Even in multi-step prediction, the proposed system represents strong robustness as usual.

3.4. Experiment II: Test the Superiority of the APCP System

Based on the results of Experiment I, Experiment II mainly analyzes the percentage improvement of the APCP system compared with four well-known models, which are listed in Table 7. The influence of the magnitude is eliminated, and the superiority of the APCP system can be objectively evaluated. In addition, the average percentage improvement of the three regions is also calculated to illustrate the generality of the APCP system performance.

(a) For the dataset in Gz, the performance of the APCP system is greatly improved in the 1-step and 3-step prediction, while it is relatively small in the 2-step prediction. For example, the MRE value of the APCP system increases by at least $ℑ_{m r e (v s . L s s v m)}^{G z - 1 - s t e p} = 98.38 %$ in one-step prediction. In contrast, in the two-step prediction, it improves by $ℑ_{m r e (v s . G p)}^{G z - 1 - s t e p} = 82.07 %$ at most. The APCP system also has a significant advantage in the 3-step prediction, with MSE ranging from $ℑ_{m s e (v s . G p)}^{G z - 3 - s t e p} = 58.46 %$ to $ℑ_{m s e (v s . G r u)}^{G z - 3 - s t e p} = 76.50 %$ , which is also better than the 2-step prediction $[ℑ_{m s e (v s . G r u)}^{G z - 2 - s t e p},$ $ℑ_{m s e (v s . G p)}^{G z - 2 - s t e p}] =$ $[13.91 %, 51.58 %]$ .

(b) For the dataset in Sh, the improvement of the APCP system for some models is even higher than that of the dataset in Gz. From the results of Experiment I, the fitting result of the dataset in Gz is significantly better than that of the dataset in Sh. It may be because the dataset in Sh is noisier. However, from the perspective of improvement effect, the APCP system can also exactly predict redundant data. Taking 2-step and 3-step predictions as examples, the values of MSE and MRE in the APCP system increase by at least $ℑ_{m s e (v s . B i l s t m)}^{S h - 2 - s t e p} = 21.90 %$ and $ℑ_{m r e (v s . B i l s t m)}^{S h - 3 - s t e p} = 81.84 %$ , respectively, while for the dataset in Gz, the minimum values of $ℑ_{m s e}^{G z - 2 - s t e p}$ and $ℑ_{m r e}^{G z - 3 - s t e p}$ are only $ℑ_{m s e (v s . G r u)}^{G z - 2 - s t e p} = 13.91 %$ and $ℑ_{m r e (v s . L s s v m)}^{G z - 3 - s t e p} = 76.44 %$ respectively.

(c) For the dataset in Cd, with the increase of prediction steps, the performance advantage of the APCP system becomes more and more significant. From the comparison of MSE results, the percentage of improvement of the APCP system gradually increases $[ℑ_{m s e (v s . B i l s t m)}^{C d - 1 - s t e p}, ℑ_{m s e (v s . B i l s t m)}^{C d - 2 - s t e p}, ℑ_{m s e (v s . B i l s t m)}^{C d - 3 - s t e p}] = [17.90 %, 22.58 %,$ $67.41 %]$ . More prediction steps increase the complexity so that the well-known individual model is difficult to maintain a stable prediction, which may cause the percentage change in the APCP system. Similarly, the same conclusion can be drawn in $ℑ_{m r e}$ and $ℑ_{m a p e}$ .

Remark 2.

In the comparative analysis with the above individual model, the superiority of the APCP system is scientifically and systematically verified. Even though there are slight differences between different regions, the average effect is still outstanding.

3.5. Experiment III: Comparison of Different Module Strategies

Experiment III mainly compares the combined models using different module strategies. Specifically, by changing the strategies of the time series reconstruction module and weight search module, the effectiveness of the DE mode and weight search mechanism in the APCP system is proved. The experimental results of the combined model are depicted in Table 8 and Figure 3. To ensure the comparability of the experiment, the settings of all model parameters remain the same as before, as listed in Table 3.

(a) For the dataset in Gz, the time series reconstruction module strategy applied by the APCP system achieves the best prediction effect. The above experiment shows that when the number of prediction steps increases, the prediction result of the individual model is poor. In order to reduce the difficulty of the prediction model, different time series reconstruction strategies are applied to the experiment, which is the empirical mode decomposition (Emd) and ensemble Emd (Eemd). It can be seen that the Eemd strategy does not produce the expected effect $[M S E_{E e m d - W s}^{G z - 1 - s t e p}, M S E_{E e m d - W s}^{G z - 2 - s t e p}, M S E_{E e m d - W s}^{G z - 3 - s t e p}] = [83.250, 194.027, 215.869]$ , while the Emd strategy helps to improve the prediction accuracy $[M S E_{E m d - W s}^{G z - 1 - s t e p}, M S E_{E m d - W s}^{G z - 2 - s t e p},$ $M S E_{E m d - W s}^{G z - 3 - s t e p}] = [47.653, 108.801, 137.224]$ . Although the R² value at the 3-step prediction $R_{E m d - W s}^{2} = 0.901 (G z - 3 - s t e p)$ is higher than that of the APCP system $R_{A P C P}^{2} = 0.854 (G z - 3 - s t e p)$ , it is not as perfect as the APCP system at all other times $[M S E_{A P C P}^{G z - 1 - s t e p}, M S E_{A P C P}^{G z - 2 - s t e p}, M S E_{A P C P}^{G z - 3 - s t e p}] = [20.861, 76.568, 85.532]$ .

(b) For the dataset in Sh, based on the same DE mode, the weight search mechanism of the APCP system can contain the advantages of all individual models as much as possible. Considering the function of the multi-objective optimization mechanism, Moda and Mogwo optimization mechanism is utilized to evaluate the weight search capacity. The effects of these two optimization mechanisms are basically the same $[M A P E_{D E - M o d a}^{S h - 1 - s t e p}, M A P E_{D E - M o g w o}^{S h - 1 - s t e p}] = [12.558 %, 12.356 %]$ , except that they are better $M R E_{D E - M o g w o}^{S h - 2 - s t e p} = 0 . 005$ than the APCP system $M R E_{A P C P}^{S h - 2 - s t e p} = - 0 . 025$ in individual indicators, and the prediction performance is far inferior to the APCP system $[M A P E_{A P C P}^{S h - 1 - s t e p}, M A P E_{A P C P}^{S h - 2 - s t e p}, M A P E_{A P C P}^{S h - 3 - s t e p}] = [9.139 %, 15.477 %, 14.994 %]$ .

(c) For the dataset in Cd, no matter how the module strategies change, the APCP system still maintains an absolute advantage. Taking the result of the 1-step prediction as an example, the MSE value of the APCP system is reduced by $(M S E_{D E - M o d a}^{C d - 1 - s t e p} - M S E_{A P C P}^{C d - 1 - s t e p}) / M S E_{D E - M o d a}^{C d - 1 - s t e p} = (138.789 - 49.914) / 138.789 \times 100 % = 64.036 %$ to that of the DE-Moda model. Compared with the MRE value of the DE-Mogwo model $M R E_{D E - M o g w o}^{C d - 1 - s t e p} = 0.048$ , the APCP system is only $M R E_{A P C P}^{C d - 1 - s t e p} = 0.012$ . Similarly, the MAPE value of the APCP system is $M A P E_{E m d - W s}^{C d - 1 - s t e p} - M A P E_{A P C P}^{C d - 1 - s t e p} =$ $10.261 % - 7.560 % = 2.701 %$ lower than that of the Emd-Ws model. The R² value reaches the maximum $[R_{A P C P}^{S}, R_{D E - M o d a}^{2}, R_{D E - M o g w o}^{2}, R_{E e m d - W s}^{2}, R_{E m d - W s}^{2}] =$ $[0.985, 0.959, 0.957, 0.960, 0.977]$ .

Remark 3.

The DE mode and weight search mechanism adopted by the APCP system can minimize the complexity of input data and integrate the advantages of the individual models, which is the key to improving the prediction performance of the APCP system.

4. Discussions

In this section, three discussions, which are significance analysis, correlation analysis, and sensitivity analysis, are implemented to more comprehensively and scientifically demonstrate the perfect performance of the APCP system.

4.1. Significance Analysis

Significance analysis aims to explain whether the difference between the APCP system and the model involved in Experiment I~III is significant. Diebold-Mariano (DM) statistic $ϒ^{D M}$ is a general indicator to measure the consistency of two sequences [53]. Based on this $ϒ^{D M}$ , this subsection compares the results between the APCP system and eight models, and the results are represented in Table 9.

Compared with the four well-known individual models, the prediction results of the APCP system have significant advantages $[ϒ_{B i l s t m (G z)}^{D M (2 - s t e p)}, ϒ_{B i l s t m (S h)}^{D M (2 - s t e p)}, ϒ_{B i l s t m (C d)}^{D M (2 - s t e p)}] =$ $[1.76, 1.77, 1.66]$ at the significance level of $α = 0.10$ , except that there is no significant difference between the Gru $ϒ_{G r u (G z)}^{D M (2 - s t e p)} = 0.30$ and Lssvm $[ϒ_{L s s v m (S h)}^{D M (2 - s t e p)}, ϒ_{L s s v m (C d)}^{D M (2 - s t e p)}] =$ $[1.59, 1 . 22]$ models and the APCP system in the 2-step prediction results. For the combined models with different module strategies, although the Emd-Ws model accepts $H_{0} : E [H (φ_{ι} - {\hat{φ}}_{ι}^{'})] = E [H (φ_{ι} - {\hat{φ}}_{ι}^{″})]$ in the 2-step and 3-step prediction results of the Sh dataset $[ϒ_{E m d - W s (S h)}^{D M (2 - s t e p)}, ϒ_{E m d - W s (S h)}^{D M (2 - s t e p)}] = [- 1.04, - 0.52]$ , in other cases (DE-Moda, DE-Mogwo, and Eemd-Ws models), $H_{0} : E [H (φ_{ι} - {\hat{φ}}_{ι}^{'})] = E [H (φ_{ι} - {\hat{φ}}_{ι}^{″})]$ is rejected at the significance level of $α = 0.05$ , such as $ϒ_{D E - M o d a (S h)}^{D M (1 - s t e p)} = 2.72$ . In other words, it is significantly different from the APCP system. In summary, the APCP system has significant advantages in improving prediction performance.

4.2. Correlation Analysis

This subsection mainly adopts the method of grey relational degree analysis [54] to calculate the correlation $ς_{υ}$ between the prediction results ${\hat{φ}}_{τ}^{υ}$ of all models $υ$ and the real values $φ_{τ}$ . The comparison results are revealed in Table 10.

In the correlation analysis of all models, the APCP system has the strongest correlation $[ς_{A P C P}^{G z - 1 - s t e p}, ς_{A P C P}^{G z - 2 - s t e p}, ς_{A P C P}^{G z - 3 - s t e p}] = [0.875, 0.847, 0.883]$ with the real value $φ_{τ}$ . The predicted value of the APCP system not only does not decrease with the increase of prediction steps but also improves $[ς_{A P C P}^{S h - 1 - s t e p}, ς_{A P C P}^{S h - 2 - s t e p}, ς_{A P C P}^{S h - 3 - s t e p}] = [0.888, 0.857, 0.876]$ , which also confirms the stability of APCP system performance. For example, for the dataset in Cd, the $ς_{A P C P}^{C d - 1 - s t e p}$ value of the 1-step prediction is $ς_{A P C P}^{C d - 1 - s t e p} = 0.840$ , while the $ς_{A P C P}^{C d - 3 - s t e p}$ value of the 3-step prediction is $ς_{A P C P}^{C d - 1 - s t e p} = 0.850$ .

4.3. Sensitivity Analysis

In order to verify the generality of the APCP system, this section discusses the stability of the weight search mechanism. Three important parameters, including iteration number $\vec{I T e r} = [I T e r^{1}, I T e r^{2}, I T e r^{3}, I T e r^{4}, I T e r^{5}] = [300, 350, 400, 450, 500]$ , population number $\vec{α} = [α^{1}, α^{2}, α^{3}, α^{4}, α^{5}] =$ $[20, 40, 60,$ $80, 100]$ , and archive size $\vec{A R c h I v e_{\max}} = [A R c h I v e_{\max}^{1}, A R c h I v e_{\max}^{2}, A R c h I v e_{\max}^{3}, A R c h I v e_{\max}^{4}, A R c h I v e_{\max}^{5}] = [100, 200, 300,$ $400, 500]$ , are selected to evaluate changes in APCP system performance. The results of the discussion are depicted in Table 11 and Figure 4.

(a) The performance of the APCP system can not fluctuate greatly with the change in ITer. The sensitivity indicators ${\bar{χ}}_{M R E^{I T e r}}$ and ${\bar{χ}}_{R^{2 (I T e r)}}$ show the volatility of MRE and R² values in the APCP system when ITer changes. From the results of the three cities, the ${\bar{χ}}_{M R E^{I T e r}}$ and ${\bar{χ}}_{R^{2 (I T e r)}}$ values remain unchanged, which are close to 0.

(b) The change in $α$ has no obvious influence on the evaluation index of the APCP system. Although ${\bar{χ}}_{M S E^{α}}^{C d - 2 - s t e p} = 6.765$ fluctuate significantly in the 2-step prediction, this is only an individual case. It may be caused by insufficient times of experiments. In most cases, the four evaluation indicators of the APCP system reflect good stability $[{\bar{χ}}_{M S E^{α}}^{G z - 1 - s t e p}, {\bar{χ}}_{M R E^{α}}^{G z - 1 - s t e p}, {\bar{χ}}_{M A P E^{α}}^{G z - 1 - s t e p}, {\bar{χ}}_{R^{2 (α)}}^{G z - 1 - s t e p}] = [0.070, 0.000, 0.010, 0.000]$ .

(c) From the adjustment of ARchIve_max, the stability of the APCP system can also be verified. Except for ${\bar{χ}}_{M S E^{A R c h I v e_{\max}}}^{C d - 2 - s t e p} = 6.412$ in the two-step prediction of the dataset in Cd, the error evaluation index of the APCP system has no significant fluctuation. Even the results in the 3-step prediction emerge with superior stability $[{\bar{χ}}_{M S E^{A R c h I v e_{\max}}}^{C d - 3 - s t e p}, {\bar{χ}}_{M R E^{A R c h I v e_{\max}}}^{C d - 3 - s t e p}, {\bar{χ}}_{M A P E^{A R c h I v e_{\max}}}^{C d - 3 - s t e p}, {\bar{χ}}_{R^{2 (A R c h I v e_{\max})}}^{C d - 3 - s t e p}] = [1.671, 0.001, 0.062, 0.001]$ .

Remark 4.

The parameter adjustment of the weight search mechanism has no significant impact on the APCP system, which demonstrates that the system has excellent stability.

5. Conclusions and Prospect

With the comprehensive popularization of urbanization and the rapid increase in energy consumption, many air pollution problems have been brought to cities. To formulate protective measures timely and effectively, a novel APCP system is proposed for environmental system management in this paper. First, based on DE mode, the AP sequence is reconstructed to reduce redundant interference. Then, four individual models with superior performance are selected to realize AP prediction. Next, the weight search mechanism is designed to balance the precision and robustness of the proposed system. Finally, appropriate weights are adopted to integrate the advantages of all models, which can obtain a perfect prediction value.

From Experiment I~III, this conclusion can be drawn that the multi-step prediction performance of the APCP system is superior. Taking the dataset in Gz as an example, in the 1-step prediction, the MSE of the APCP system has a minimum value $M S E_{A P C P}^{G z - 1 - s t e p} = 20.861$ . Compared with the Lssvm model, the value of MRE is reduced by $ℑ_{m r e (v s . L s s v m)}^{G z - 1 - s t e p} = 98.38 %$ . For the results of the 2-step prediction, the value of MAPE is $M A P E_{E m d - W s}^{G z - 2 - s t e p} - M A P E_{A P C P}^{G z - 2 - s t e p} = 1.441 %$ less than that of the Emd-Ws model. It is worth noting that the R² value predicted by the APCP system in 3-step is even better than that predicted by the DE-Moda model in 2-step $[R_{A P C P}^{S (G z - 3 - s t e p)}, R_{D E - M o d a}^{S (G z - 2 - s t e p)}] =$ $[0.854, 0.668]$ . To sum up, the comprehensive capacity of the APCP system is outstanding, which can be used as an effective tool in actual AP prediction.

Although this paper assumes that the AP concentration sequence is only related to historical values, many external factors, such as wind speed, affect the concentration changes of AP in real life. Based on this consideration, the APCP system can be further expanded in future research to improve its prediction performance.

Author Contributions

Y.H.: Conceptualization, software, writing—original draft preparation. Y.Z.: methodology, visualization, writing—reviewing and editing. J.G.: writing—original draft preparation, software validation. J.W.: formal analysis, software validation. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the contribution of the anonymous reviewers and the editors.

Conflicts of Interest

The authors declare no conflict of interest.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures and Tables

Figure 1. The flowchart of the APCP system.

View Image - Figure 2. The prediction results of the APCP system and four well-known models in Experiment I. The three-line charts represent the fitting results of the 1-step prediction in the three regions. The bar chart shows the difference in MSE values for the dataset in Gz. The radar chart describes the changes in MAPE values in well-known models and the APCP system for the dataset in Sh. The horizontal bar chart depicts the comparison results of MRE values in different models for the dataset in Cd.

Figure 2. The prediction results of the APCP system and four well-known models in Experiment I. The three-line charts represent the fitting results of the 1-step prediction in the three regions. The bar chart shows the difference in MSE values for the dataset in Gz. The radar chart describes the changes in MAPE values in well-known models and the APCP system for the dataset in Sh. The horizontal bar chart depicts the comparison results of MRE values in different models for the dataset in Cd.

View Image - Figure 3. Performance comparison of different combined models for the dataset in Gz. The six subfigures on the left side of the dotted line indicate how close the predicted results of all models in Experiment III are to the real values. The three subfigures on the right compare the differences in MAPE, R2, and MSE values between the APCP system and the other combined models.

Figure 3. Performance comparison of different combined models for the dataset in Gz. The six subfigures on the left side of the dotted line indicate how close the predicted results of all models in Experiment III are to the real values. The three subfigures on the right compare the differences in MAPE, R2, and MSE values between the APCP system and the other combined models.

Figure 4. Sensitivity assessment of average results in three cities.

Table 1

Evaluation of data-driven prediction models for AP concentration.

Models	Ref.	Dataset	Conclusions	Strengths	Limitations
GAM	[41]	PM_2.5 in Beijing	The lag order and climatic conditions have the most significant influence on the change in PM_2.5 concentration.	The GAM model intuitively explains the reasons for the change and diffusion of PM_2.5 concentration.	The prediction accuracy of this model is limited.
Markov chain model	[42]	API in Malaysia	Markov chain model can be used as an effective tool in haze pollution prediction.	The model is simple in structure and easy to operate.	The higher-order extended form of Markov chain is not considered.
SNgbn (1,1) model	[43]	AQI, PM₁₀, PM_2.5, SO₂, NO₂, CO, and O₃ in the Yangtze River Delta	For data with seasonal periodic fluctuations, the model provides stable prediction results.	The SNgbn (1,1) model simulates the seasonal characteristics of APs to a great extent.	External factors are not added to the model.
3D-CBLstm	[44]	PM_2.5 in Beijing	The application of clustering analysis and feature selection strategy is conducive to the improvement of the prediction effect.	The 3D-CBLstm model not only realizes the efficient extraction of important features but also considers the long-term correlation in the sequence.	The selection of prediction model parameters is subjective.
XGBoost-Garch-MLP	[45]	PM_2.5 in Shanxi	This model can effectively predict the fluctuation range of PM_2.5 concentration, which is helpful to identify the moving direction of PM_2.5.	The quality of input data is improved based on feature selection and four Garch extended models comprehensively cover the fluctuation interval of PM_2.5 concentration.	The selection of input variables and prediction models needs to be further optimized.
CWfm	[46]	PM₁₀, PM_2.5, NO₂, SO₂, O₃, and CO in Beijing	The CWfm model is scientific and efficient for predicting the concentration of APs.	The proposed combined model has better fitting results than its submodel.	The influencing factors considered are not comprehensive enough.
Dcnn	[47]	Meteorological and AP (NO_X and O₃) data in Texas	Compared with the deterministic models and linear models, the prediction results of this model are significantly improved.	Predictions can be successfully achieved even when there are fewer input dimensions.	The model has poor accuracy in estimating extreme values.
CEemd-CRJ-MLR model	[48]	Meteorological and AP (NO₂, CO, and O₃) data in Beijing	The improved CRJ model is effectively applied to the prediction of AP concentration, and the prediction performance of the hybrid model is improved.	The hybrid model also has accurate results for the long-term prediction of the concentration of APs.	The structural design of the model is complex, which reduces the universality.
Pso-Svm model	[49]	Meteorological and AP (AQI, PM₁₀, PM_2.5, SO₂, NO₂, CO, and O₃) data in Beijing	The hybrid model is superior to the benchmark models in both fitting accuracy and simulation speed.	As the amount of data is reduced, the running time of the model is shortened.	The influence of holidays, seasons, and other relevant information is not included.
Emd-Gru model	[50]	PM_2.5 in Beijing	Compared with the Gru model, the proposed combined model shows the best results in all error measurement indicators.	The problem of time lag is perfectly solved.	The fitting results of different spaces are lacking, and the regional versatility of the model is limited.
Combined model based on the L₁ norm	[51]	PM₁₀, PM_2.5, SO₂, NO₂, CO, and O₃ in Baoding, Tianjin, and Shijiazhuang	The proposed model can accurately evaluate future air quality and has broad application prospects.	The model parameters are adjusted based on the optimization algorithm, which enhances the scientificity and feasibility.	The model structure is complex.

Table 2

Comparison results of pre-experiment.

Models	Guangzhou				Shanghai				Chengdu				Average
Models	MSE	MRE	MAPE	R²	MSE	MRE	MAPE	R²	MSE	MRE	MAPE	R²	MSE	MRE	MAPE	R²
Cnn	25.477	0.009	5.465%	0.958	43.353	0.039	9.564%	0.973	55.457	0.034	8.224%	0.983	41.429	0.027	7.751%	0.971
Bilstm	31.704	0.014	6.119%	0.947	47.755	0.028	10.225%	0.970	60.800	0.041	8.414%	0.982	46.753	0.028	8.252%	0.966
Lssvm	29.878	0.013	5.752%	0.950	51.454	0.058	10.610%	0.968	61.730	0.049	9.123%	0.982	47.687	0.040	8.495%	0.967
Gp	100.916	0.057	12.038%	0.832	270.369	0.352	39.540%	0.830	156.823	0.175	20.018%	0.953	176.036	0.195	23.865%	0.872
Lstm	22.180	0.011	5.043%	0.963	39.016	0.030	9.876%	0.975	51.518	0.031	7.970%	0.985	37.571	0.024	7.630%	0.974
Gru	34.112	0.029	7.025%	0.943	68.920	0.120	17.239%	0.957	145.464	0.132	17.194%	0.957	82.832	0.094	13.819%	0.952
Elm	23.160	0.012	5.282%	0.962	41.545	0.030	9.563%	0.974	52.633	0.033	8.117%	0.984	39.113	0.025	7.654%	0.973
Enn	23.465	0.012	5.330%	0.961	41.920	0.035	9.796%	0.974	53.782	0.036	8.320%	0.984	39.722	0.028	7.815%	0.973

Note: This table reports the results of submodel simulations, the assessment metrics employed in this simulation include MSE, MAPE, MRE and R². Bold numbers indicate that the average simulation results in all submodels are better.

Table 3

The explanation and the corresponding value of the APCP system.

Systems	Symbol	Explanation	Value	Systems	Symbol	Explanation	Value
Bilstm	$m_{ε}$	Max epochs number	400	Gp	$Gl$	Gaussian likelihood	−1
Bilstm	$η_{ν}$	Hidden layer node numbers	20	Gp	$ι_{ν}$	Input layer node number	4
BPnn, Elm, Enn	$ι_{ν}$	Input layer node numbers	4	Lstm	$ε_{τ}$	Epochs of training	500
	$ο_{ν}$	Output layer node numbers	1	Emd	$S_{r}$	Stopping rule of sifting	wave
	$η_{ν}$	Hidden layer node numbers	20	Emd	$B_{d}$	Boundary	type 5
Cnn	$ν_{κ}$	Number of kernels in convolutional layer	3	Eemd	$Ν_{σ τ δ}$	Signal-to-noise ratio	0.1
	$σ_{κ}$	Kernel size of the convolutional layer	40	DE	${\tilde{M}}_{I}$	Maximum iteration number	500
	$h_{n}$	Hidden layer node numbers	[384,384]		$Ν_{ε}$	Number of noise additions	50
Gru	$m_{ε}$	Max epochs number	2000		$Ν_{σ τ δ}$	Signal-to-noise ratio	0.1
Gru	$μ_{β σ}$	Mini batch size	256	Ws, Moda, Mogwo	${\tilde{M}}_{I}$	Maximum iteration number	500
Lssvm	$Κ_{ϕ}$	Kernel function parameter	5		${\tilde{S}}_{A}$	Archive size	400
Lssvm	$Χ_{λ}$	Penalty parameter	5		${\tilde{C}}_{N}$	Chameleon number	60

Note: Mogwo means the multi-objective grey wolf optimization.

Table 4

The statistical assessments of the dataset.

Datasets	No.	Max.	Min.	Mean	Std.	Mlye
Guangzhou
Total	744	176	8	34.16	69.61	0.23
Train	595	176	8	35.79	69.79	0.17
Test	149	142	31	26.77	68.89	0.37
Shanghai
Total	744	255	11	55.56	84.04	0.31
Train	595	255	11	55.99	90.41	0.28
Test	149	203	13	45.86	58.60	0.04
Chengdu
Total	744	335	15	64.71	141.84	0.27
Train	595	335	15	59.47	154.84	0.28
Test	149	233	23	58.59	89.91	0.14

Note: This table summarizes the main statistical information of three datasets, and the mathematical formulas of Mean, Std. and Mlye are $M e a n = (1 / ℤ) \sum {\tilde{O}}_{σ}$ , $S t d . = \sqrt{\sum {({\tilde{O}}_{σ} - {\bar{O}}_{σ})}^{2} / (ℤ - 1)}$ , and $M l y e = (1 / (T_{l} - T_{0})) \sum l o g_{2} (δ_{ι} / δ_{ι - 1})$ , respectively. Besides, the unit of PM_2.5 concentration data is $μ g / m^{3}$ .

Table 5

The mathematical formula of four evaluation metrics.

Metrics	Mathematical Formula
Mean Absolute Percentage Error	$M A P E = (1 / ℤ) \sum (\|{\hat{φ}}_{ι} - φ_{ι}\| / φ_{ι}) \times 100 %$
Mean Relative Error	$M R E = (1 / ℤ) \sum (({\hat{φ}}_{ι} - φ_{ι}) / φ_{ι})$
Mean Squared Error	$M S E = (1 / ℤ) {\sum (φ_{ι} - {\hat{φ}}_{ι})}^{2}$
R-squared score	$R^{2} = 1 - \sum {({\hat{φ}}_{ι} - φ_{ι})}^{2} / \sum {(φ_{ι} - {\bar{φ}}_{ι})}^{2}$

Note: $φ_{ι}$ is the $ι$ -th actual value of the AP concentration series, and ${\hat{φ}}_{ι}$ is the $ι$ -th output result of the APCP system.

Table 6

Comparison results with the individual models.

Models	City	Guangzhou				Shanghai				Chengdu
Models	City	MSE	MRE	MAPE	R²	MSE	MRE	MAPE	R²	MSE	MRE	MAPE	R²
APCP	1-step	20.861	0.000	4.883%	0.965	38.445	0.006	9.139%	0.976	49.914	0.012	7.560%	0.985
	2-step	76.568	−0.014	9.552%	0.871	143.385	−0.025	15.477%	0.903	157.888	0.021	13.515%	0.953
	3-step	85.532	−0.014	9.710%	0.854	147.137	−0.028	14.994%	0.894	158.789	0.027	13.810%	0.952
Bilstm	1-step	31.704	0.014	6.119%	0.947	47.755	0.028	10.225%	0.970	60.800	0.041	8.414%	0.982
	2-step	97.906	0.031	10.704%	0.835	183.587	0.078	20.620%	0.876	203.926	0.110	16.432%	0.939
	3-step	209.020	0.060	16.410%	0.644	416.976	0.156	31.115%	0.699	487.225	0.217	27.335%	0.853
Gru	1-step	34.112	0.029	7.025%	0.943	68.920	0.120	17.239%	0.957	145.464	0.132	17.194%	0.957
	2-step	88.941	0.046	11.036%	0.850	222.910	0.277	34.932%	0.849	520.354	0.300	34.546%	0.844
	3-step	364.031	0.064	19.808%	0.381	585.365	0.511	59.411%	0.577	1425.353	0.570	61.621%	0.571
Gp	1-step	100.916	0.057	12.038%	0.832	270.369	0.352	39.540%	0.830	156.823	0.175	20.018%	0.953
	2-step	158.148	0.078	15.547%	0.733	465.399	0.489	54.192%	0.685	368.929	0.278	31.152%	0.889
	3-step	205.923	0.098	18.482%	0.650	681.216	0.632	69.090%	0.508	640.693	0.373	41.128%	0.807
Lssvm	1-step	29.878	0.013	5.752%	0.950	51.454	0.058	10.610%	0.968	61.730	0.049	9.123%	0.982
	2-step	101.226	0.031	10.821%	0.829	185.359	0.131	20.207%	0.874	183.649	0.101	16.178%	0.945
	3-step	209.427	0.058	16.510%	0.644	408.764	0.219	29.946%	0.705	371.823	0.160	23.578%	0.888
Cnn	1-step	25.477	0.009	5.465%	0.958	43.353	0.039	9.564%	0.973	55.457	0.034	8.224%	0.983
	2-step	95.182	0.022	10.614%	0.839	165.475	0.091	17.971%	0.888	173.737	0.092	15.444%	0.948
	3-step	94.856	0.023	10.537%	0.839	160.774	0.081	18.188%	0.884	166.672	0.062	14.993%	0.950
Lstm	1-step	22.180	0.011	5.043%	0.963	39.016	0.030	9.876%	0.975	51.518	0.031	7.970%	0.985
	2-step	92.552	0.019	10.346%	0.844	179.377	0.066	23.206%	0.878	180.829	0.071	15.373%	0.946
	3-step	94.432	0.020	10.303%	0.839	175.869	0.069	22.962%	0.873	181.508	0.066	15.209%	0.945
Elm	1-step	23.160	0.012	5.282%	0.962	41.545	0.030	9.563%	0.974	52.633	0.033	8.117%	0.984
	2-step	86.027	0.029	10.312%	0.855	159.395	0.081	18.464%	0.892	166.130	0.087	15.212%	0.950
	3-step	92.910	0.018	10.404%	0.842	165.485	0.061	19.901%	0.880	176.767	0.064	15.206%	0.947
Enn	1-step	23.465	0.012	5.330%	0.961	41.920	0.035	9.796%	0.974	53.782	0.036	8.320%	0.984
	2-step	87.895	0.031	10.345%	0.852	162.584	0.096	19.111%	0.890	167.932	0.094	15.414%	0.950
	3-step	93.483	0.019	10.452%	0.841	169.585	0.069	20.610%	0.877	180.407	0.066	15.460%	0.946

Table 7

The improvement percentage of the APCP system.

City		Guangzhou			Shanghai			Chengdu			Average
City		$ℑ_{m s e}$	$ℑ_{m r e}$	$ℑ_{m a p e}$	$ℑ_{m s e}$	$ℑ_{m r e}$	$ℑ_{m a p e}$	$ℑ_{m s e}$	$ℑ_{m r e}$	$ℑ_{m a p e}$	$ℑ_{m s e}$	$ℑ_{m r e}$	$ℑ_{m a p e}$
APCP vs. Bilstm	1-step	34.20%	98.40%	20.20%	19.50%	79.32%	10.62%	17.90%	71.02%	10.16%	23.87%	82.91%	13.66%
	2-step	21.79%	55.72%	10.77%	21.90%	67.66%	24.94%	22.58%	80.96%	17.75%	22.09%	68.11%	17.82%
	3-step	59.08%	76.97%	40.83%	64.71%	81.84%	51.81%	67.41%	87.61%	49.48%	63.73%	82.14%	47.37%
APCP vs. Gru	1-step	38.85%	99.23%	30.49%	44.22%	95.16%	46.99%	65.69%	90.98%	56.03%	49.58%	95.12%	44.50%
	2-step	13.91%	69.42%	13.45%	35.68%	90.90%	55.69%	69.66%	92.99%	60.88%	39.75%	84.44%	43.34%
	3-step	76.50%	78.41%	50.98%	74.86%	94.44%	74.76%	88.86%	95.28%	77.59%	80.08%	89.38%	67.78%
APCP vs. Gp	1-step	79.33%	99.62%	59.44%	85.78%	98.35%	76.89%	68.17%	93.18%	62.24%	77.76%	97.05%	66.19%
	2-step	51.58%	82.07%	38.56%	69.19%	94.85%	71.44%	57.20%	92.43%	56.62%	59.33%	89.78%	55.54%
	3-step	58.46%	86.04%	47.46%	78.40%	95.51%	78.30%	75.22%	92.79%	66.42%	70.69%	91.45%	64.06%
APCP vs. Lssvm	1-step	30.18%	98.38%	15.11%	25.28%	90.04%	13.87%	19.14%	75.60%	17.14%	24.87%	88.01%	15.37%
	2-step	24.36%	55.64%	11.73%	22.64%	80.69%	23.41%	14.03%	79.16%	16.46%	20.34%	71.83%	17.20%
	3-step	59.16%	76.44%	41.19%	64.00%	87.06%	49.93%	57.29%	83.17%	41.43%	60.15%	82.22%	44.18%

Note: This table reports the improvement percentage of the APCP system, the assessment metrics employed in this experiment includes MSE, MRE, and MAPE, and the mathematical formulas of the metric $Φ$ can be described as $ℑ_{Φ} = (Φ_{c o m p a r i s o n} - Φ_{A P C P}) / Φ_{c o m p a r i s o n}$ .

Table 8

Error evaluation based on different module strategies.

Models	City	Guangzhou				Shanghai				Chengdu
Models	City	MSE	MRE	MAPE	R²	MSE	MRE	MAPE	R²	MSE	MRE	MAPE	R²
APCP	1-step	20.861	0.000	4.883%	0.965	38.445	0.006	9.139%	0.976	49.914	0.012	7.560%	0.985
	2-step	76.568	−0.014	9.552%	0.871	143.385	−0.025	15.477%	0.903	157.888	0.021	13.515%	0.953
	3-step	85.532	−0.014	9.710%	0.854	147.137	−0.028	14.994%	0.894	158.789	0.027	13.810%	0.952
DE-Moda	1-step	44.299	−0.018	6.346%	0.926	78.536	−0.016	12.558%	0.951	138.789	0.055	12.368%	0.959
	2-step	196.907	−0.062	13.260%	0.668	291.930	−0.040	20.260%	0.802	324.790	0.022	16.480%	0.902
	3-step	236.687	−0.083	14.610%	0.597	322.913	−0.071	20.461%	0.767	388.779	0.179	23.553%	0.883
DE-Mogwo	1-step	45.530	−0.008	6.333%	0.924	80.208	0.007	12.356%	0.949	144.490	0.048	12.091%	0.957
	2-step	190.901	−0.029	13.188%	0.678	303.205	0.005	19.997%	0.795	312.185	0.101	18.881%	0.906
	3-step	255.447	−0.060	14.885%	0.565	316.368	−0.023	19.873%	0.771	478.242	0.007	17.896%	0.856
Eemd-Ws	1-step	83.250	−0.020	8.714%	0.862	125.311	−0.006	13.821%	0.921	135.219	0.048	11.996%	0.960
	2-step	194.027	−0.054	13.224%	0.673	292.060	−0.042	20.282%	0.802	309.633	0.071	17.753%	0.907
	3-step	215.869	−0.052	13.967%	0.633	311.389	−0.043	19.927%	0.775	381.970	0.144	21.816%	0.885
Emd-Ws	1-step	47.653	−0.026	6.535%	0.921	79.500	0.010	12.204%	0.950	76.215	−0.088	10.261%	0.977
	2-step	108.801	−0.067	10.993%	0.816	116.014	−0.009	16.138%	0.921	879.255	−0.381	39.738%	0.736
	3-step	137.224	0.123	20.357%	0.901	162.807	0.011	17.037%	0.882	294.961	−0.132	18.134%	0.911

Note: This table reports the comparison results between the APCP system with other different module strategies, which include DE-Moda, DE-Mogwo, Eemd-Ws, and Emd-Ws, where Ws means the weight search mechanism of the APCP system.

Table 9

Significance comparison results of the APCP model.

Models	Guangzhou			Shanghai			Chengdu
Models	1-Step	2-Step	3-Step	1-Step	2-Step	3-Step	1-Step	2-Step	3-Step
Bilstm	2.62 **	1.76 *	4.21 **	1.87 *	1.77 *	3.70 **	1.96 **	1.66 *	5.04 **
Gru	1.68 *	0.30	3.48 **	3.22 **	2.33 **	6.42 **	7.50 **	8.29 **	11.90 **
Gp	5.15 **	3.45 **	3.88 **	6.99 **	5.90 **	6.96 **	8.98 **	6.22 **	8.20 **
LSsvm	2.83 **	2.02 **	4.22 **	2.02 **	1.59	3.32 **	2.52 **	1.22	4.71 **
DE-Moda	3.35 **	3.28 **	3.87 **	2.72 **	3.57 **	2.72 **	4.68 **	4.22 **	5.04 **
DE-Mogwo	3.39 **	3.26 **	3.92 **	2.64 **	3.19 **	3.07 **	4.64 **	4.63 **	3.57 **
Eemd-Ws	4.49 **	3.22 **	3.11 **	3.75 **	3.60 **	2.96 **	4.62 **	4.50 **	4.54 **
Emd-Ws	3.41 **	1.93 **	4.45 **	2.71 **	−1.04	−0.52	3.60 **	12.53 **	3.05 **

Note: This table gives the results of DM statistics $ϒ^{D M}$ , which can be expressed as: $ϒ^{D M} =$ $\{\sum_{ι = 1}^{ℤ} [H (φ_{ι} - {\hat{φ}}_{ι}^{'}) - H (φ_{ι} - {\hat{φ}}_{ι}^{″})] / ℤ\} / \sqrt{{σ^}^{2} / ℤ}$ , where ${σ^}^{2} = \sum_{ι = 1}^{ℤ} \{[H (φ_{ι} - {\hat{φ}}_{ι}^{'}) - H (φ_{ι} - {\hat{φ}}_{ι}^{″})] -$ ${[\bar{H (φ_{ι} - {\hat{φ}}_{ι}^{'}) - H (φ_{ι} - {\hat{φ}}_{ι}^{″})}]\}}^{2}$ , and $H (\cdot)$ is loss function. Besides, if $ϒ^{D M} ţ 1.65$ (marked with *) or $ϒ^{D M} ţ 1.96$ (marked with **), it means that $H_{0} : E [H (φ_{ι} - {\hat{φ}}_{ι}^{'})] = E [H (φ_{ι} - {\hat{φ}}_{ι}^{″})]$ will be rejected at the significance level of $α = 0.10$ or $α = 0.05$ , respectively.

Table 10

Discussion results of correlation analysis.

Models	Guangzhou			Shanghai			Chengdu
Models	1-Step	2-Step	3-Step	1-Step	2-Step	3-Step	1-Step	2-Step	3-Step
APCP	0.875	0.847	0.883	0.888	0.857	0.876	0.840	0.807	0.850
Bilstm	0.857	0.835	0.826	0.876	0.831	0.802	0.825	0.791	0.768
Gru	0.862	0.845	0.801	0.857	0.791	0.725	0.762	0.702	0.633
Gp	0.757	0.794	0.821	0.711	0.711	0.699	0.721	0.712	0.723
LSsvm	0.856	0.833	0.822	0.876	0.838	0.809	0.820	0.799	0.793
DE-Moda	0.839	0.799	0.831	0.857	0.817	0.836	0.777	0.782	0.786
DE-Mogwo	0.839	0.806	0.831	0.860	0.820	0.842	0.776	0.776	0.810
Eemd-Ws	0.799	0.802	0.839	0.834	0.817	0.840	0.778	0.782	0.805
Emd-Ws	0.837	0.825	0.816	0.859	0.861	0.867	0.794	0.621	0.818

Note: This table shows the correlation between all models and real series. The expression of $ς_{υ}$ is $ς_{υ} = \sum_{τ = 1}^{ℤ} [(\underset{υ}{m i n} \underset{τ}{m i n} |φ_{τ} - {\hat{φ}}_{ι}^{υ}| + μ \underset{υ}{m a x} \underset{τ}{m a x} |φ_{τ} - {\hat{φ}}_{ι}^{υ}|) / (|φ_{τ} - {\hat{φ}}_{ι}^{υ}| + μ \underset{υ}{m a x} \underset{τ}{m a x} |φ_{τ} - {\hat{φ}}_{ι}^{υ}|)] / ℤ$ .

Table 11

Sensitivity analysis results of the APCP system.

City	Guangzhou			Shanghai			Chengdu
City	1-Step	2-Step	3-Step	1-Step	2-Step	3-Step	1-Step	2-Step	3-Step
MSE	0.066	1.632	3.202	0.609	1.856	0.420	0.111	4.351	1.818
MRE	0.000	0.001	0.002	0.002	0.003	0.001	0.001	0.001	0.001
MAPE	0.015	0.057	0.167	0.176	0.344	0.075	0.054	0.126	0.118
R²	0.000	0.003	0.005	0.000	0.001	0.000	0.000	0.001	0.001
MSE	0.070	1.422	3.138	0.461	1.342	0.413	0.322	6.765	2.122
MRE	0.000	0.001	0.002	0.001	0.003	0.001	0.000	0.003	0.001
MAPE	0.010	0.079	0.189	0.135	0.315	0.132	0.046	0.265	0.119
R²	0.000	0.002	0.005	0.000	0.001	0.000	0.000	0.002	0.001
MSE	0.036	1.839	3.587	0.660	1.678	1.045	0.816	6.412	1.671
MRE	0.000	0.001	0.002	0.001	0.003	0.001	0.000	0.003	0.001
MAPE	0.009	0.092	0.174	0.140	0.282	0.111	0.048	0.300	0.062
R²	0.000	0.003	0.006	0.000	0.001	0.001	0.000	0.002	0.001

Note: When one parameter is changed, the other parameters retain the default values. Sensitivity indicator is $χ_{I^{P}} = \sum_{a = 1}^{Q} {(I^{P_{a}} - \bar{I^{P}})}^{2} / Q$ , where $Q = 5$ indicates the number of parameter P changes, $I^{P_{a}}$ represents the error indicator $I$ of the a-th parameter P, and $\bar{I^{P}}$ is the mean of $I^{P_{a}}$ . Besides, the measurement results ${\bar{χ}}_{I^{P}} = \sum_{i = 1}^{10} χ_{I_{i}^{P}} / 10$ in this table are the average values of 10 operations.

References

1. Liu, H.Y.; Dunea, D.; Iordache, S.; Pohoata, A. A Review of Airborne Particulate Matter Effects on Young Children’s Respiratory Symptoms and Diseases. Atmosphere; 2018; 9, 150. [DOI: https://dx.doi.org/10.3390/atmos9040150]

2. Yang, S.; Fang, D.; Chen, B. Human Health Impact and Economic Effect for PM_2.5 Exposure in Typical Cities. Appl. Energy; 2019; 249, pp. 316-325. [DOI: https://dx.doi.org/10.1016/j.apenergy.2019.04.173]

3. Hao, Y.; Niu, X.; Wang, J. Impacts of Haze Pollution on China’s Tourism Industry: A System of Economic Loss Analysis. J. Environ. Manag.; 2021; 295, 113051. [DOI: https://dx.doi.org/10.1016/j.jenvman.2021.113051] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34182342]

4. Kim, K.H.; Kabir, E.; Kabir, S. A Review on the Human Health Impact of Airborne Particulate Matter. Environ. Int.; 2015; 74, pp. 136-143. [DOI: https://dx.doi.org/10.1016/j.envint.2014.10.005] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/25454230]

5. Maji, K.J.; Ye, W.F.; Arora, M.; Shiva Nagendra, S.M. PM_2.5-Related Health and Economic Loss Assessment for 338 Chinese Cities. Environ. Int.; 2018; 121, pp. 392-403. [DOI: https://dx.doi.org/10.1016/j.envint.2018.09.024] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30245362]

6. Zhang, Y.; Bocquet, M.; Mallet, V.; Seigneur, C.; Baklanov, A. Real-Time Air Quality Forecasting, Part I: History, Techniques, and Current Status. Atmos. Environ.; 2012; 60, pp. 632-655. [DOI: https://dx.doi.org/10.1016/j.atmosenv.2012.06.031]

7. Wang, J.; Wang, R.; Li, Z. A Combined Forecasting System Based on Multi-Objective Optimization and Feature Extraction Strategy for Hourly PM_2.5 Concentration. Appl. Soft Comput.; 2022; 114, 108034. [DOI: https://dx.doi.org/10.1016/j.asoc.2021.108034]

8. Djalalova, I.; Delle Monache, L.; Wilczak, J. PM_2.5 Analog Forecast and Kalman Filter Post-Processing for the Community Multiscale Air Quality (CMAQ) Model. Atmos. Environ.; 2015; 108, pp. 76-87. [DOI: https://dx.doi.org/10.1016/j.atmosenv.2015.02.021]

9. Baker, K.R.; Woody, M.C.; Valin, L.; Szykman, J.; Yates, E.L.; Iraci, L.T.; Choi, H.D.; Soja, A.J.; Koplitz, S.N.; Zhou, L. et al. Photochemical Model Evaluation of 2013 California Wild Fire Air Quality Impacts Using Surface, Aircraft, and Satellite Data. Sci. Total Environ.; 2018; 637–638, pp. 1137-1149. [DOI: https://dx.doi.org/10.1016/j.scitotenv.2018.05.048] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29801207]

10. Zhang, Q.; Xue, D.; Liu, X.; Gong, X.; Gao, H. Process Analysis of PM_2.5 Pollution Events in a Coastal City of China Using CMAQ. J. Environ. Sci.; 2019; 79, pp. 225-238. [DOI: https://dx.doi.org/10.1016/j.jes.2018.09.007]

11. Lee, K.; Yu, J.; Lee, S.; Park, M.; Hong, H.; Park, S.Y.; Choi, M.; Kim, J.; Kim, Y.; Woo, J.H. et al. Development of Korean Air Quality Prediction System Version 1 (KAQPS v1) with Focuses on Practical Issues. Geosci. Model Dev.; 2020; 13, pp. 1055-1073. [DOI: https://dx.doi.org/10.5194/gmd-13-1055-2020]

12. Ryu, Y.H.; Hodzic, A.; Barre, J.; Descombes, G.; Minnis, P. Quantifying Errors in Surface Ozone Predictions Associated with Clouds over the CONUS: A WRF-Chem Modeling Study Using Satellite Cloud Retrievals. Atmos. Chem. Phys.; 2018; 18, pp. 7509-7525. [DOI: https://dx.doi.org/10.5194/acp-18-7509-2018]

13. Cheng, X.; Liu, Y.; Xu, X.; You, W.; Zang, Z.; Gao, L.; Chen, Y.; Su, D.; Yan, P. Lidar Data Assimilation Method Based on CRTM and WRF-Chem Models and Its Application in PM_2.5 Forecasts in Beijing. Sci. Total Environ.; 2019; 682, pp. 541-552. [DOI: https://dx.doi.org/10.1016/j.scitotenv.2019.05.186] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31129542]

14. Abdi-Oskouei, M.; Carmichael, G.; Christiansen, M.; Ferrada, G.; Roozitalab, B.; Sobhani, N.; Wade, K.; Czarnetzki, A.; Pierce, R.B.; Wagner, T. et al. Sensitivity of Meteorological Skill to Selection of WRF-Chem Physical Parameterizations and Impact on Ozone Prediction During the Lake Michigan Ozone Study (LMOS). J. Geophys. Res. Atmos.; 2020; 125, e2019JD031971. [DOI: https://dx.doi.org/10.1029/2019JD031971]

15. Lopez-Restrepo, S.; Yarce, A.; Pinel, N.; Quintero, O.L.; Segers, A.; Heemink, A.W. Forecasting PM₁₀ and PM_2.5 in the Aburrá Valley (Medellín, Colombia) via EnKF Based Data Assimilation. Atmos. Environ.; 2020; 232, 117507. [DOI: https://dx.doi.org/10.1016/j.atmosenv.2020.117507]

16. Wei, W.; Lv, Z.F.; Li, Y.; Wang, L.T.; Cheng, S.; Liu, H. A WRF-Chem Model Study of the Impact of VOCs Emission of a Huge Petro-Chemical Industrial Zone on the Summertime Ozone in Beijing, China. Atmos. Environ.; 2018; 175, pp. 44-53. [DOI: https://dx.doi.org/10.1016/j.atmosenv.2017.11.058]

17. Chen, Q.; Taylor, D. Transboundary Atmospheric Pollution in Southeast Asia: Current Methods, Limitations and Future Developments. Crit. Rev. Environ. Sci. Technol.; 2018; 48, pp. 997-1029. [DOI: https://dx.doi.org/10.1080/10643389.2018.1493337]

18. De Mattos Neto, P.S.G.; Madeiro, F.; Ferreira, T.A.E.; Cavalcanti, G.D.C. Hybrid Intelligent System for Air Quality Forecasting Using Phase Adjustment. Eng. Appl. Artif. Intell.; 2014; 32, pp. 185-191. [DOI: https://dx.doi.org/10.1016/j.engappai.2014.03.010]

19. Baptista, M.; Sankararaman, S.; de Medeiros, I.P.; Nascimento, C.; Prendinger, H.; Henriques, E.M.P. Forecasting Fault Events for Predictive Maintenance Using Data-Driven Techniques and ARMA Modeling. Comput. Ind. Eng.; 2018; 115, pp. 41-53. [DOI: https://dx.doi.org/10.1016/j.cie.2017.10.033]

20. Wang, J.; Lei, C.; Guo, M. Daily Natural Gas Price Forecasting by a Weighted Hybrid Data-Driven Model. J. Pet. Sci. Eng.; 2020; 192, 107240. [DOI: https://dx.doi.org/10.1016/j.petrol.2020.107240]

21. Aladağ, E. Forecasting of Particulate Matter with a Hybrid ARIMA Model Based on Wavelet Transformation and Seasonal Adjustment. Urban Clim.; 2021; 39, 100930. [DOI: https://dx.doi.org/10.1016/j.uclim.2021.100930]

22. Bhatti, U.A.; Yan, Y.; Zhou, M.; Ali, S.; Hussain, A.; Qingsong, H.; Yu, Z.; Yuan, L. Time Series Analysis and Forecasting of Air Pollution Particulate Matter (PM_2.5): An SARIMA and Factor Analysis Approach. IEEE Access; 2021; 9, pp. 41019-41031. [DOI: https://dx.doi.org/10.1109/ACCESS.2021.3060744]

23. Shaziayani, W.N.; Ul-Saufie, A.Z.; Ahmat, H.; Al-Jumeily, D. Coupling of Quantile Regression into Boosted Regression Trees (BRT) Technique in Forecasting Emission Model of PM₁₀ Concentration. Air Qual. Atmos. Health; 2021; 14, pp. 1647-1663. [DOI: https://dx.doi.org/10.1007/s11869-021-01045-3]

24. Liu, T.; Lau, A.K.H.; Sandbrink, K.; Fung, J.C.H. Time Series Forecasting of Air Quality Based On Regional Numerical Modeling in Hong Kong. J. Geophys. Res. Atmos.; 2018; 123, pp. 4175-4196. [DOI: https://dx.doi.org/10.1002/2017JD028052]

25. Abdullah, S.; Napi, N.N.L.M.; Ahmed, A.N.; Mansor, W.N.W.; Mansor, A.A.; Ismail, M.; Abdullah, A.M.; Ramly, Z.T.A. Development of Multiple Linear Regression for Particulate Matter (PM₁₀) Forecasting during Episodic Transboundary Haze Event in Malaysia. Atmosphere; 2020; 11, 289. [DOI: https://dx.doi.org/10.3390/atmos11030289]

26. Mohd Napi, N.N.L.; Noor Mohamed, M.S.; Abdullah, S.; Mansor, A.A.; Ahmed, A.N.; Ismail, M. Multiple Linear Regression (MLR) and Principal Component Regression (PCR) for Ozone (O₃) Concentrations Prediction. IOP Conf. Ser. Earth Environ. Sci.; 2020; 616, 012004. [DOI: https://dx.doi.org/10.1088/1755-1315/616/1/012004]

27. Bai, Y.; Li, Y.; Wang, X.; Xie, J.; Li, C. Air Pollutants Concentrations Forecasting Using Back Propagation Neural Network Based on Wavelet Decomposition with Meteorological Conditions. Atmos. Pollut. Res.; 2016; 7, pp. 557-566. [DOI: https://dx.doi.org/10.1016/j.apr.2016.01.004]

28. Li, X.; Peng, L.; Yao, X.; Cui, S.; Hu, Y.; You, C.; Chi, T. Long Short-Term Memory Neural Network for Air Pollutant Concentration Predictions: Method Development and Evaluation. Environ. Pollut.; 2017; 231, pp. 997-1004. [DOI: https://dx.doi.org/10.1016/j.envpol.2017.08.114]

29. Jiang, X.; Wei, P.; Luo, Y.; Li, Y. Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model. Atmosphere; 2021; 12, 1452. [DOI: https://dx.doi.org/10.3390/atmos12111452]

30. Weihong, W.; Shuangshuang, N. The Performance of Several Combining Forecasts for Stock Index. Proceedings of the 2008 International Seminar on Future Information Technology and Management Engineering; Leicestershire, UK, 20 November 2008; pp. 450-455. [DOI: https://dx.doi.org/10.1109/FITME.2008.42]

31. Wang, J.; Li, J.; Li, Z. Prediction of Air Pollution Interval Based on Data Preprocessing and Multi-Objective Dragonfly Optimization Algorithm. Front. Ecol. Evol.; 2022; 10, 855606. [DOI: https://dx.doi.org/10.3389/fevo.2022.855606]

32. Yang, W.; Sun, S.; Hao, Y.; Wang, S. A Novel Machine Learning-Based Electricity Price Forecasting Model Based on Optimal Model Selection Strategy. Energy; 2022; 238, 121989. [DOI: https://dx.doi.org/10.1016/j.energy.2021.121989]

33. Murillo-Escobar, J.; Sepulveda-Suescun, J.P.; Correa, M.A.; Orrego-Metaute, D. Forecasting Concentrations of Air Pollutants Using Support Vector Regression Improved with Particle Swarm Optimization: Case Study in Aburrá Valley, Colombia. Urban Clim.; 2019; 29, 100473. [DOI: https://dx.doi.org/10.1016/j.uclim.2019.100473]

34. Wu, Q.; Lin, H. A Novel Optimal-Hybrid Model for Daily Air Quality Index Prediction Considering Air Pollutant Factors. Sci. Total Environ.; 2019; 683, pp. 808-821. [DOI: https://dx.doi.org/10.1016/j.scitotenv.2019.05.288] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31154159]

35. Li, G.; Chen, L.; Yang, H. Prediction of PM_2.5 Concentration Based on Improved Secondary Decomposition and CSA-KELM. Atmos. Pollut. Res.; 2022; 13, 101455. [DOI: https://dx.doi.org/10.1016/j.apr.2022.101455]

36. Gan, R.; Guo, Q.; Chang, H.; Yi, Y. Improved Ant Colony Optimization Algorithm for the Traveling Salesman Problems. J. Syst. Eng. Electron.; 2010; 21, pp. 329-333. [DOI: https://dx.doi.org/10.3969/j.issn.1004-4132.2010.02.025]

37. Dhiman, G.; Kumar, V. Seagull Optimization Algorithm: Theory and Its Applications for Large-Scale Industrial Engineering Problems. Knowledge-Based Syst.; 2019; 165, pp. 169-196. [DOI: https://dx.doi.org/10.1016/j.knosys.2018.11.024]

38. Zhou, Q.; Lv, Q.; Zhang, G. A Combined Forecasting System Based on Modified Multi-Objective Optimization for Short-Term Wind Speed and Wind Power Forecasting. Appl. Sci.; 2021; 11, 9383. [DOI: https://dx.doi.org/10.3390/app11209383]

39. Zhou, Y.; Wang, J.; Li, Z.; Lu, H. Short-Term Photovoltaic Power Forecasting Based on Signal Decomposition and Machine Learning Optimization. Energy Convers. Manag.; 2022; 267, 115944. [DOI: https://dx.doi.org/10.1016/j.enconman.2022.115944]

40. Wang, J.; Gao, J.; Wei, D. Electric Load Prediction Based on a Novel Combined Interval Forecasting System. Appl. Energy; 2022; 322, 119420. [DOI: https://dx.doi.org/10.1016/j.apenergy.2022.119420]

41. Wu, Z.; Zhang, S. Study on the Spatial–Temporal Change Characteristics and Influence Factors of Fog and Haze Pollution Based on GAM. Neural Comput. Appl.; 2019; 31, pp. 1619-1631. [DOI: https://dx.doi.org/10.1007/s00521-018-3532-z]

42. Zakaria, N.N.; Othman, M.; Sokkalingam, R.; Daud, H.; Abdullah, L.; Kadir, E.A. Markov Chain Model Development for Forecasting Air Pollution Index of Miri, Sarawak. Sustainability; 2019; 11, 5190. [DOI: https://dx.doi.org/10.3390/su11195190]

43. Zhou, W.; Wu, X.; Ding, S.; Cheng, Y. Predictive Analysis of the Air Quality Indicators in the Yangtze River Delta in China: An Application of a Novel Seasonal Grey Model. Sci. Total Environ.; 2020; 748, 141428. [DOI: https://dx.doi.org/10.1016/j.scitotenv.2020.141428]

44. Kim, J.; Wang, X.; Kang, C.; Yu, J.; Li, P. Forecasting Air Pollutant Concentration Using a Novel Spatiotemporal Deep Learning Model Based on Clustering, Feature Selection and Empirical Wavelet Transform. Sci. Total Environ.; 2021; 801, 149654. [DOI: https://dx.doi.org/10.1016/j.scitotenv.2021.149654] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34416605]

45. Dai, H.; Huang, G.; Zeng, H.; Zhou, F. PM_2.5 Volatility Prediction by XGBoost-MLP Based on GARCH Models. J. Clean. Prod.; 2022; 356, 131898. [DOI: https://dx.doi.org/10.1016/j.jclepro.2022.131898]

46. Liu, B.; Yu, X.; Chen, J.; Wang, Q. Air Pollution Concentration Forecasting Based on Wavelet Transform and Combined Weighting Forecasting Model. Atmos. Pollut. Res.; 2021; 12, 101144. [DOI: https://dx.doi.org/10.1016/j.apr.2021.101144]

47. Sayeed, A.; Choi, Y.; Eslami, E.; Lops, Y. Using a Deep Convolutional Neural Network to Predict 2017 Ozone Concentrations, 24 Hours in Advance. Neural Netw.; 2019; 121, pp. 396-408. [DOI: https://dx.doi.org/10.1016/j.neunet.2019.09.033] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31604202]

48. Mo, Y.; Li, Q.; Karimian, H.; Fang, S.; Tang, B.; Chen, G. A Novel Framework for Daily Forecasting of Ozone Mass Concentrations Based on Cycle Reservoir with Regular Jumps Neural Networks. Atmos. Environ.; 2020; 220, 117072. [DOI: https://dx.doi.org/10.1016/j.atmosenv.2019.117072]

49. Chen, S.; Wang, J.; Zhang, H. A Hybrid PSO-SVM Model Based on Clustering Algorithm for Short-Term Atmospheric Pollutant Concentration Forecasting. Technol. Forecast. Soc. Chang.; 2019; 146, pp. 41-54. [DOI: https://dx.doi.org/10.1016/j.techfore.2019.05.015]

50. Huang, G.; Li, X.; Zhang, B.; Ren, J. PM_2.5 Concentration Forecasting at Surface Monitoring Sites Using GRU Neural Network Based on Empirical Mode Decomposition. Sci. Total Environ.; 2021; 768, 144516. [DOI: https://dx.doi.org/10.1016/j.scitotenv.2020.144516] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33453525]

51. Wang, B.; Jiang, Q.; Jiang, P. A Combined Forecasting Structure Based on the L 1 Norm: Application to The. J. Environ. Manag.; 2019; 246, pp. 299-313. [DOI: https://dx.doi.org/10.1016/j.jenvman.2019.05.124]

52. Saremi, S.; Mirjalili, S.; Lewis, A. Grasshopper Optimisation Algorithm: Theory and Application. Adv. Eng. Softw.; 2017; 105, pp. 30-47. [DOI: https://dx.doi.org/10.1016/j.advengsoft.2017.01.004]

53. Diebold, F.X.; Mariano, R.S. Comparing Predictive Accuracy. J. Bus. Econ. Stat.; 1995; 13, pp. 253-263. [DOI: https://dx.doi.org/10.1080/07350015.1995.10524599]

54. Liu, S.; Lin, Y. Introduction to Grey Systems Theory. Understanding Complex Systems; Springer: Berlin/Heidelberg, Germany, 2010; Volume 68, pp. 1-399. [DOI: https://dx.doi.org/10.1007/978-3-642-16158-2_1]

Word count: 10257

Show less

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

With the continuous expansion of the industrial production scale and the rapid promotion of urbanization, more and more serious air pollution threatens people’s lives and social development. To reduce the losses caused by polluted weather, it is popular to predict the concentration of pollutants timely and accurately, which is also a research hotspot and challenging issue in the field of systems engineering. However, most studies only pursue the improvement of prediction accuracy, ignoring the function of robustness. To make up for this defect, a novel air pollutant concentration prediction (APCP) system is proposed for environmental system management, which is constructed by four modules, including time series reconstruction, submodel simulation, weight search, and integration. It not only realizes the filtering and reconstruction of redundant series based on the decomposition-ensemble mode, but also the weight search mechanism is designed to trade off precision and stability. Taking the hourly concentration of PM_2.5 in Guangzhou, Shanghai, and Chengdu, China as an example, the simulation results show that the APCP system has perfect prediction capacity and superior stability performance, which can be used as an effective tool to guide early warning decision-making in the management of environmental engineering.

Details

Title

A Novel Air Pollutant Concentration Prediction System Based on Decomposition-Ensemble Mode and Multi-Objective Optimization for Environmental System Management

Author

Yan, Hao¹; Zhou, Yilin²; Gao, Jialu²; Wang, Jianzhou³

¹ Business School, Shandong Normal University, Jinan 250014, China
² School of Statistics, Dongbei University of Finance and Economics, Dalian 116025, China
³ Institute of Systems Engineering, Macau University of Science and Technology, Macao 999078, China

First page

139

Publication year

2022

Publication date

2022

Publisher

MDPI AG

e-ISSN

20798954

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.3390/systems10050139

ProQuest document ID

2728532038

A Novel Air Pollutant Concentration Prediction System Based on Decomposition-Ensemble Mode and Multi-Objective Optimization for Environmental System Management

Jump to:

Full text

Abstract

Details

Suggested sources