1. Introduction
Time series prediction has been a classic machine learning task widely studied in recent years, and it has been applied to many fields, such as business [1], engineering [2], energy [3], and so on. It is hoped that some prediction methods can be used to find the internal association rules of the time series to predict the future changes.
Popular time series prediction methods include classical time series models, such as linear regression, autoregression, and moving average [4,5], as well as artificial intelligence methods, such as support vector regression (SVR) [6], artificial neural networks (ANNs) [7], nonlinear autoregressive (NAR) [8], recurrent neural networks (RNNs) [9,10], long short-term memory (LSTM) networks [11,12], and so on. The classical time series models usually require the data to be analyzed to meet the ergodicity, stationarity, and other assumptions [13], and these strict conditions may not always hold for real world data. Although many intelligent methods such as SVR and ANNs come with the ability to capture the nonlinear relationship between the input and output variables, and have been applied to solve many real-word problems [14,15,16,17], the quality of their prediction depends very much on the data set to be processed [18]. These numerical prediction methods are computationally expensive and more suitable for a short-term prediction. In addition, the lack of interpretability of these numerical methods has become the main barrier in their wide acceptance in time series prediction applications. These methods, as well as their results, are not easy to be understood by humans.
The fuzzy information granulation of the time series [19,20] is one of the feasible methods to solve the above shortcomings. In this method, a time series is firstly divided into some meaningful time windows in the time dimension, and then a fuzzy information granule (FIG) is established on each time window. In this way, a time series is transformed to a sequence of FIGs, each of which is a fuzzy set, and the time series can be now analyzed at the granular level by directly manipulating the FIGs. The combination of an FIG with fuzzy reasoning [21], a support vector machine [22], and other artificial intelligence algorithms for data mining [23,24,25] has become one of the research hotspots in recent years.
An FIG is usually represented by a fuzzy set, whose membership function is defined on the real number axis. Common forms of the membership functions of FIGs include the interval membership function, triangular membership function, and Gaussian membership function [26,27,28], and the corresponding FIGs are called Interval FIG(IFIG), Triangular FIG(TFIG), and Gaussian FIG(GFIG) in this paper, respectively. These types of FIGs can reflect the characteristics of the average and range of the temporal data in a time window, but fail to reflect the trend characteristic of the data.
As a modification, the linear fuzzy information granule (LFIG) [29] is represented by a fuzzy set whose membership function is defined both on the real number axis and the time axis. Like the LFIG, the polar fuzzy information granule (polar FIG) [30] defines its membership function in two-dimensional polar coordinates. Compared with the FIG, the added dimension in the membership functions of the LFIG and the polar FIG makes them possible to express the linear changing trends of the samples in the time window. However, both modifications failed to reflect the nonlinear trends of the samples.
In [31], a novel Gaussian-type time-variant FIG, named the generalized zonary time-variant FIG (GZTFIG), was proposed which can reflect the nonlinear trend of data changing. However, the form of the GZTFIG is a bit complex. In addition, GZTFIGs have a lack of specificity [32], that is, their semantics are not so clear.
To solve this problem, based on [29,31], this paper introduces a novel FIG, namely, the polynomial fuzzy information granule (PFIG), which represents the temporal samples in a time window by three kinds of parameters: (1) the length of the time window, (2) a polynomial center line, and (3) the degree of data deviation from the center line. Temporal samples can be reasonably characterized by the polynomial center line with an adjustable order. In the sense of the Hausdorff distance, the distance formula of the PFIG is derived theoretically. In particular, we prove that the distance of the two Gaussian PFIGs has a concise formula expression and intuitive geometric interpretation. Therefore, the PFIG is a well-defined information granule with a good distance property, which is hopeful to become an effective tool in time series granulation and prediction.
The remainder of this paper is organized as follows. Section 2 introduces traditional fuzzy information granules and their distance measure. Section 3 introduces PFIGs as well as their distance measure. It can be proved that the distance of Gaussian PFIGs have a simple formula, which corresponds to a reasonable geometric interpretation. Section 4 presents a long-term prediction method for the time series based on the distance measure of PFIGs and fuzzy inference system with an interpolation scheme. Section 5 describes three experiments to verify the effectiveness and feasibility of the proposed model. Finally, Section 6 provides the conclusions and offers some thoughts on future studies.
2. Fuzzy Information Granules and Their Distance
In this section, we briefly introduce the construction method of some common FIGs along with their distance measurement. These concepts are useful for defining the novel type of granule (PFIG) given in Section 3.
2.1. Fuzzy Numbers and Their Distance Measurement
A fuzzy set (class) in is characterized by a membership (characteristic) function which associates with each point in a real number in the interval [0,1] [33]. Fuzzy numbers are special fuzzy sets. A fuzzy set is called a fuzzy number if its membership function satisfies the following conditions [34]:
-
(a). is normal: there exists a number such that ;
-
(b). is convex: ,;
-
(c). is upper semicontinuous: , , if , then .
-
(d). is compactly supported: the closure of the set is compact.
Figure 1 shows examples of one-dimensional and two-dimensional fuzzy numbers.
Fuzzy numbers generalize classical real intervals. Accordingly, the distance of the fuzzy numbers can also be extended from the Hausdorff distance of the real number intervals. Named after Felix Hausdorff, the Hausdorff distance is a generalized metric in the family of closed sets, which measures the maximum distance of a set to the nearest point in the other set [35]. Let be the -level set (or called -cut) of fuzzy number . Since the -level sets and of fuzzy numbers and are classical real intervals, their Hausdorff distance can be written as:
(1)
where represents the distance of real numbers and . Based on such a distance for real sets, the Hausdorff distance of fuzzy numbers and can be defined as [36]:(2)
2.2. Fuzzy Information Granules and Their Distance
Constructing an FIG to represent a group of temporal data should comply with the following two fairly conflicting conditions [32]. The first condition is called representativeness, that is, should embrace enough data in . In another word, the total membership of belonging to , , should be as large as possible, so that has a good data coverage. The second condition is called specificity, that is, should be specific enough. This is accomplished by keeping the support of as compact (small) as possible, so that has a better semantic clarity [32]. According to the types of membership functions, FIGs can be divided into Interval FIGs (IFIGs), Triangular FIGs (TFIGs), Gaussian FIGs (GFIGs), etc. Denote the membership function of an FIG as , where is the parameter set of the membership function of the corresponding type. The optimal value of can be determined by optimizing both the representativeness and the specificity of the FIG through the optimization methods described below [32].
2.2.1. Interval Fuzzy Information Granules and Their Distance
An IFIG, denoted as or , comes with the following membership function:
(3)
Let be the parameter set of this membership function, are the left and right boundary points of the interval-type membership function, which can be determined by the following optimization model:
(4)
The numerator part of Equation (4) is related to the number of data that belong to the IFIG . Maximizing the numerator part can make achieve the best data coverage or representativeness. The denominator part of Equation (4) is related to the support of . Minimizing the denominator part can make achieve the best specificity.
As far as the distance of two IFIGs is concerned, according to Equations (1) and (2), the distance of and can be written as:
(5)
2.2.2. Triangular Fuzzy Information Granules and Their Distance
A TFIG, denoted as or , comes with the following membership function:
(6)
Let be the parameter set of , where and are called the “left extreme point”, the “normal point”, and the “right extreme point”, respectively. can be determined as the median of , and the extreme points and can be determined by the following optimization model [37]:
(7)
The numerator part of Equation (7) is related to the total membership degrees of belonging to , which reflects the representativeness of to . Maximizing the numerator part can make achieve the best representativeness. The denominator part of Equation (7) is related to the support of . Minimizing the denominator part can make achieve the best specificity.
As far as the distance of two TFIGs is concerned, according to Equations (1) and (2), the distance of and can be written as:
(8)
2.2.3. Gaussian Fuzzy Information Granules and Their Distance
A GFIG, denoted as or , comes with the following membership function:
(9)
Let be the parameter set of , where and can be estimated by the mean value and standard deviation of , respectively. The GFIG has a clear meaning, and compared with the TFIG, the GFIG has less parameters with simpler parameter determination methods. According to the extension principle [38], the linear operation of GFIGs has a concise calculation property. That is:
([29]). For any two GFIGs and any two real numbers , we have
(10)
Furthermore, the Hausdorff distance of two GFIGs and can be concisely written as [29]:
(11)
where and .3. Polynomial Fuzzy Information Granules and Their Distance Metric
3.1. Polynomial Fuzzy Information Granules
Suppose we are going to granulize a time series as shown in Figure 2a. The IFIG, TFIG, and GFIG obtained by the corresponding granulation methods given by Equations (4), (7), and (9) in Section 2 are shown in Figure 2b–d, respectively. Because the membership functions of the IFIG, TFIG, and GFIG (as shown in Equations (3), (6), and (9)) are time-independent, their membership values of depend only on . As a result, these FIGs can properly reflect the average and range information of . However, they fail to reflect the changing trends of the time series, since the time-varying relationship between and is ignored.
One way to remedy this defect is to add the time variable to the membership functions of the FIGs. The two-dimensional membership function of an FIG can be obtained by shifting the one-dimensional membership function in Figure 2 along the time axis. Follow this mechanism, the LFIGs [29] accomplish this goal by making the membership function of the GFIG (as shown in Figure 2d) change linearly over time. The two-dimensional membership function of the LFIG is shown in Figure 3.
Motivated by the time variant LFIG, we define a novel class of FIG called polynomial fuzzy information granules (PFIG) in this paper. In order to fully reflect the time-varying characteristics of the data in the time window, a reasonable PFIG should contain three types of parameters:
-
The length of the time series , i.e., the size of time window, ;
-
A time-variant -order center (regression) curve line that reflects the changing of the time series;
-
The parameters reflecting the deviation degree of the temporal data from the center line.
(PFIG) For a time series , a fuzzy information granule representing is called a PFIG of order , if its membership function has a center curve line:
(12)
and can be written as:(13)
where is a fuzzy number, is its membership function, is a parameter set of the PFIG that reflects the deviation degree of the data from the center line, and the length of the time series is called the granularity of the PFIG.Recently, Luo proposed a novel generalized zonary time-variant FIG (GZTFIG) in [31], which is able to reflect the nonlinear trends of the time series. The membership function of a GZTFIG is defined as:
where reflects the data fluctuation interval in the current time window. can be determined by making all the data in locate in the zonary area between the upper boundary , and the lower boundary .Different from the GZTFIGs proposed in [31], the proposed PFIG no longer restricts its membership function to be Gaussian. Users can choose any appropriate type of membership function to construct a PFIG according to the characteristics of the time series. For the following three reasons, the PFIG uses the center curve line, instead of the zonal central region in the GZTFIG to reflect the changing trends of a time series. First, the width of the zonal region is to reflect the degree of data deviation from the center line. However, this feature can also be reflected by the parameter in the PFIG, which is more concise. Secondarily, compared with GZTFIGs, the process of determining the zonary area is to avoid being in the PFIG. Thus, the construction of the PFIG is simpler. Last but not the least, the central zonary area of a GZTFIG is determined by translating the central line upward and downward, so that all the data are included in the zonary area. This operation increases the representativeness of the GZTFIG but decreases its specificity.
Using the definition of PFIGs, we can give the definitions for three special kinds of PFIGs, namely interval-type PFIGs (IPFIGs), triangular-type PFIGs (TPFIGs), and Gaussian-type PFIGs (GPFIGs).
(IPFIG). A PFIG is called an IPFIG of order , if its membership function can be written as:
where and are the left and right boundary points of the interval-type membership function at , respectively; represents the center curve line of PFIG; and is the radius of this interval.(TPFIG): A PFIG is called a TPFIG of order if its membership function can be written as:
where represents the center curve line of the PFIG, and and are the left and right extreme points of the triangular membership function at , respectively.(GPFIG): A PFIG is called a GPFIG of order if its membership function can be written as:
(14)
where is the center line of PFIG, and is the standard deviation that reflects the degree of data deviation from the center line.The construction of a GPFIG is straightforward. The center line can be estimated by the polynomial regression of the temporal data , and can be estimated by:
(15)
According to the above definitions, the temporal data in Figure 2a can be represented by a second order GPFIG as shown in Figure 4. Compared with the IFIG, TFIG, and GFIG shown in Figure 2b–d, the GPFIG can better reflect the changing trends in the temporal data, as shown in Figure 2a.
When the order of a PFIG is 0, the PFIG degenerates into the same type of FIGs introduced in Section 1, and when the order of a GPFIG is 1, the center line of the GPFIG becomes a linear function . Accordingly, the GPFIG degenerates into the LFIG proposed by [29]. Therefore, PFIGs, including GPFIGs, are a generalization of LFIGs.
A GPFIG is an information granule with good properties. From its definition, it can be found that the number of parameters of a GPFIG is small, its meaning is clear, and the Gaussian-type FIG has excellent operation properties, as shown in Theorem 1. In addition, the GPFIG is also easy to understand, given a GPFIG , we can imagine a time series of length , distributed around a center curve line , with as the deviation degree of the temporal data from the center line. Therefore, the GPFIG is a good tool to construct the information granule to describe a group of temporal data. For this reason, our experiments in Section 5 will focus only on the GPFIG.
3.2. Distance Metric of Polynomial Fuzzy Information Granules
By comparing Figure 4 and Figure 1b, we can find that the PFIG is a special type of two-dimensional fuzzy set. For a PFIG , its membership function is a two-dimensional convex surface. As shown in Figure 5, the -level set of is:
(16)
which can be regarded as a two-dimensional closed interval in the time-number plane (- plane).Similar to Equation (2), define the distance of two PFIGs and as the sum of all the distances of the -level sets [36]:
(17)
where is the Hausdorff distance of two-dimensional closed intervals and , which can be written as:(18)
where .The calculation of Equation (18) is very complex and is computationally difficult to implement. To solve this problem, we should redefine based on the following considerations. The Hausdorff distance is mainly used for the distance measurement of the general multidimensional real intervals, where every dimension is usually independent of the other dimensions. However, since the temporal data are special two-dimensional data, where can be regarded as a function of the time , these temporal data should have a distance definition different from Equation (18).
As shown in Figure 6, cut the -level set at , we get a cutting segment :
(19)
which is a one-dimensional closed interval.According to Equation (1), the Hausdorff distance of two cutting segments and at time is:
(20)
Then, the distance between and can be regarded as the sum of the distances of all these cutting segment pairs at all , that is:
(21)
In this way, the distance of two PFIGs can be found by substituting Equation (21) to Equation (17). The distance of any different type of PFIGs can be calculated when the corresponding membership function in Equation (13) is selected. In particular, the distance of two Gaussian PFIGs have the following expression.
(Distance of GPFIGs): the Hausdorff distance of two GPFIGs, namely and , can be written as the area between their center lines and , and a term proportional to the difference of and , that is:
(22)
where is the center line of the GPFIG.According to Equation (14), the -level set of GPFIG can be written as:
Substitute it into Equation (17) and the distance between and can be written as:
□Obviously, the first term of Equation (22) is the integral of the distance difference between two center lines and in , which represents the distance caused by the center lines. As illustrated in Figure 7, the geometric meaning of this term is the area between the two center lines. The second term is times of , which represents the distance caused by data deviations. Therefore, the distance between two GPFIGs is determined by their center lines on the one hand, and the data deviations on the other. This is consistent with our intuition.
When PFIGs degenerate into LFIGs, the distance given by Equation (22) is consistent with the distance of LFIGs given in [29], i.e.,
where , and are the center lines of the two LFIGs, respectively. , , and represents the intersection of the two center lines. is the difference of two data deviations.4. Granular Time Series Prediction Method Based on Fuzzy Inference
4.1. Granule Based Fuzzy Inference
Given a time series , divide it into consecutive time windows, and then construct an FIG on each time window, and we can obtain a granular time series . Such a granular time series can induce a fuzzy inference system (FIS) by the following way.
Select consecutive FIGs: , and take the first FIGs as the antecedents of a fuzzy rule, and the last one as the consequent, then these FIGs imply a -input 1-output fuzzy rule:
A time series containing FIGs can form a total of rules. These rules constitute the rule base of the FIS.
As shown in Figure 8, when we use the last FIGs of this granular time series as the inputs, then the output of the FIS, namely , can be used to predicted the future values of the original time series. Similar to the process of a Mamdani-type FIS, can be written as a linear combination of all the consequents of the rules, that is:
(23)
where is the weight of rule , which can be measured by the Hausdorff distance between the FIS’ inputs , and the antecedents of the -th rule, . That is, can be written as:Normalize these weights, then can be computed from the granular time series by Equation (23).
After obtaining a future FIG (data) , it is incorporated into the original time series to form a new time series to predict the next FIG (data) . Through this closed loop forecasting process, we can predict the values of any length in the future by using the previous predictions as the input. Closed loop forecasting allows us to predict an arbitrary number of time steps, but this can easily lead to large errors because the previous predictions are not the true values during the forecasting process, and the forecasting errors from a previous process can cumulatively affect the subsequent forecasting processes.
4.2. Flow Chart of the Proposed Algorithm
The pseudo-code of the time series prediction algorithm combining FIS and GPFIGs is given in Algorithm 1 as follows:
Algorithm 1: Time series prediction algorithm based on FIS and GPFIGs. |
Input: A numerical time series of length ; the granularity (length of time window) and the polynomial order of PFIG. Number of antecedents of the fuzzy rules in the FIS. |
|
5. Experimental Research
To test the effectiveness of the perdition algorithm combining an FIS and GPFIGs (GPFIG-FIS), the forecasting results of the proposed method are compared with eight competing models in this section.
5.1. Data Description and Experimental Scheme
Three kinds of time series with a pseudo periodicity are selected for the experiment in this section. These data include: (1) the time series of the daily minimum temperature in Melbourne from 1981 to 1990 (this time series is from:
Two kinds of indexes, the root mean-square error (RMSE) and symmetric mean absolute percentage error (SMAPE), are used to measure the effect of the time series prediction methods:
Root mean-square error:
Symmetric mean absolute percentage error:
The following eight prediction methods are selected for a comparison with the GPFIG-FIS method proposed in this paper, among which the first four methods are numerical prediction methods, and the last four methods are methods based on ordinary FIGs and FIS:
AR(): Numerical -order auto regressive model (auto regressive, AR):
NAR(): the -order NAR is a -input 1-output feedforward network, whose input is a vector consisting of data before time in the given training sequence, and output is the datum in time , . The NAR uses a sigmoid transfer function in the hidden layer and a linear transfer function in the output layer. The number of hidden neurons as well as the number of hidden layers is set to 10.
SVR(): numerical -order support vector machine regress model (support vector regress, SVR) [39]:
-
LSTM: a sequence-to-sequence regression LSTM network, where the responses are the training sequences with values shifted by one time step. The LSTM updates the cell and hidden states using the hyperbolic tangent function and uses the sigmoid function as the gate activation function. The number of hidden units is set to 128.
-
IFIG-FIS: the FIS prediction method based on the IFIG. Equations (4) and (5) are used to granulate the time series, and the FIS with -input is used for the prediction.
-
TFIG-FIS: the FIS prediction method based on the TFIG. Equations (7) and (8) are used to granulate the time series, and the FIS with -input is used for the prediction.
-
GFIG-FIS: the FIS prediction method based on the GFIG. The average and standard deviation of the corresponding window data are used as the parameters in constructing the GFIG, and the FIS with -input is used for the prediction.
-
LIFG-FIS: the FIS prediction method based on the LFIG. The linear regression line and the estimate of the error variance are used as the parameters in constructing the LIFG, and the FIS with -input is used for the prediction.
5.2. The Experimental Results
5.2.1. Daily Minimum Temperature Dataset
The time series of daily maximum temperature in Melbourne City from 1981 to 1990 is shown in Figure 9. The time series contains a total of 3650 data, and the first 2928 data are selected as the training samples to predict the daily maximum temperature for the next 183, 366, and 549 days (i.e., days after 2928).
For each prediction method, the number of FIS input is set to three. Since this time series has a natural time cycle, in constructing the granular time series, the size of the time window is set to 183 days, which is about half a year. Additionally, the order of the GPFIG is set as three. Table 1 shows the RMSE and SMAPE indexes by the GPFIG-FIS and other methods for a long-term forecasting.
For brevity, Figure 10 shows the predicted values of several prediction methods with better prediction results. It can be found that compared with the numerical prediction methods, the LFIG-FIS and GPFIG-FIG get better prediction results. Because of the length of the time window, which is about half a year, it has a clear meaning in this experiment, the FIGs constructed by the LFIG-FIS and GPFIG-FIG can reflect the main characteristics of the data in each time window very well, which is beneficial to eliminate the influence of random noises in this time series. When these FIGs are regarded as linguistic variables to construct fuzzy inference rules, a reasonable FIS can be obtained. However, since numerical prediction methods are easily disturbed by noise, they cannot get accurate results for a long-term prediction.
Note that the time series used in this experiment has a clear increasing or decreasing trend in each time window. Because of this, both the LFIG-FIS and GPFIG-FIS that contain trend parameters could accurately reflect these changing trends. Therefore, compared with the other FIG-based prediction methods, the LFIG-FIS and GPFIG-FIS can get better prediction results. Among them, the GPFIG-FIS is slightly better than the LFIG-FIS because it can more accurately reflect the changing trends in each time window.
5.2.2. Power Consumption Time Series
The average power consumption data of the three urban areas every 10 min are collected in the two weeks from 16 to 30 December 2017 in Tetuan, Morocco. As shown in Figure 11, this time series contains 2016 data, and the previous 1920 data are used as the training data set, to predict the last 96 data.
Set the number of FIS input for the various methods. The length of the time window is set as 16, about one ninth of a day, in constructing various kinds of FIGs. Additionally, the polynomial order of the PFIG is set as three. Table 2 shows the RMSE and SMAPE indexes of various methods in predicting the data in the last six time windows (96 points altogether).
For brevity, Figure 12 shows the predicted values of four prediction methods with better prediction results. Figure 12 and Table 2 show that, compared with the numerical prediction methods, the FIG-based prediction methods, namely the IFIG-FIS, TFIG-FIG, LFIG-FIS, and GPFIG-FIG can have more accurate prediction results, among which the LFIG- and GPFIG-based methods can reflect the time-varying trend very well. Therefore, they have better RMSE and SMAPE indexes when dealing with long-term prediction. In predicting the values of different time periods in the future, the GPFIG-FIS always has the best RSME and SMAPE indexes. A reason for this result is that, compared with the previous experiment, the power consumption time series display more complex trend changes in each time window, and the GPFIG with a high-order polynomial center line can better describe these changes. Consequently, the prediction indexes of the GPFIG-FIS are also significantly better than other methods such as the LFIG-FIS.
5.2.3. American Heart Association Electrocardiogram Time Series
The American Heart Association (AHA) ECG datasets collect voltage values at a sampling rate of 250 Hz. A complete ECG waveform contains about 250 sampling points. So, when constructing various FIGs, the length of the time window is set as 25, which is about one-tenth of a whole waveform. Figure 13 plots a time series containing 4250 sampling points. The first 4000 sampling points (including about 16 waveforms) are used as the training set to predict the last 250 sampling points (about 1 waveform).
Set the number of the FIS input for each prediction method. Due to the complexity of the ECG waveform, the polynomial order of the PFIG is set as four. Table 3 shows the RMSE and SMAPE indexes of the various prediction methods.
For brevity, Figure 14 shows the predicted values of several prediction methods with better results. It can be found from Figure 14 and Table 3 that, compared with numerical prediction methods, the prediction methods based on the FIG and FIS have better prediction results.
The changing trends of the ECG time series are more complex than previous experiments. As is shown in Figure 14, this time series is very stable in the first four time windows, and then there are sharp fluctuations from the fifth time window. Correspondingly, in the first four time windows, the RSME and SMAPE indexes of the GPFIG-FIS are almost the same with those of the other FIS-based methods. However, from the fifth time window, the predictive indexes of the GPFIG-FIS are significantly better than the other prediction methods. The main reason for this result is that GPFIG can accurately describe the changing trends in the ECG data through its high-order polynomial center line. In summary, for the ECG time series with complex changing trends, the GPFIG-FIS method can obtain better long-term prediction results.
6. Summary
The PFIGs can accurately describe the key time-varying nonlinear trends of the time series, and thus are a suitable type of FIGs in granulating a time series. A PFIG has three parameters, namely the length of the time window, the centerline of the adjustable order polynomial, and the degree of the data deviation, which have a good interpretability and are easy to understand.
The distance metric of PFIGs is derived theoretically. It shows that the distance of two Gaussian PFIGs can be reasonably interpreted as the sum of the area between their central polynomial lines, and the difference in their data deviation degrees, which has a good geometric meaning.
This paper also designs a fuzzy inference prediction method based on the GPFIG and their distance metric to verify the effectiveness of the proposed GPFIG. The experiments show that for those time series with pseudo periods, the proposed GPFIG-FIS method can achieve better prediction results compared with some numerical prediction methods such as the AR, NAR, SVR and LSTM, and some fuzzy inference methods based on other types of FIG. This conclusion shows that the proposed GPFIG has a good practicability.
The PFIG time series constructed in this paper is composed of several PFIGs of an equal granularity. A question associated with the granulation method in this paper is how to choose the optimal granularity of these PFIGs. Without sufficient periodic knowledge of the time series to be transformed in advance, it may be difficult to determine the best granularity.
A further generalization of the PFIG can improve the ability of the granular time series in representing the complicated time series. How to use this new PFIG to deal with a real-world prediction problem may be a subject of future research.
Conceptualization, X.Y. and F.Y.; methodology, X.Y.; software, X.Y. and S.Z.; validation, X.Z. and S.Z.; formal analysis, X.Y. and S.Z.; investigation, X.Y. and F.Y.; resources, X.Y. and S.Z.; data curation, X.Y.; writing—original draft preparation, X.Y. and S.Z.; writing—review and editing, X.Y. and F.Y.; visualization, X.Y.; supervision, F.Y.; project administration, F.Y.; funding acquisition, X.Y. and F.Y. All authors have read and agreed to the published version of the manuscript.
Not applicable.
The authors would like to acknowledge the support by Fujian Provincial Key Laboratory of Data-Intensive Computing, Fujian University Laboratory of Intelligent Computing and Information Processing, Fujian Provincial Big Data Research Institute of Intelligent Manufacturing.
The authors declare no conflict of interest.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure 1. Fuzzy numbers. (a) a one-dimensional fuzzy number; (b) a two-dimensional fuzzy number.
Figure 2. Different types of fuzzy information granules. (a) time series data; (b) interval fuzzy information granule; (c) triangular fuzzy information granule; (d) Gaussian fuzzy information granule.
Figure 6. The cutting segment of the [Forumla omitted. See PDF.]-level set in a PFIG.
Figure 7. The area between the two center lines. (a) The area of two center lines when they have no intersection between [0,τ], (b) the area of two center lines when they have an intersection between [0,τ].
Comparison of the prediction indexes of daily minimum temperature.
Index | Time |
AR | NAR | SVR | LSTM | IFIG-FIS | TFIG-FIS | GFIG-FIS | LFIG-FIS | GPFIG-FIS |
---|---|---|---|---|---|---|---|---|---|---|
RMSE | 2929–3112 | 9.83 | 6.79 | 6.49 | 4.30 | 8.76 | 6.67 | 6.45 | 4.44 | 4.19 |
2929–3295 | 14.76 | 7.01 | 6.54 | 4.25 | 7.57 | 6.39 | 6.18 | 4.27 | 4.14 | |
2929–3478 | 17.23 | 6.83 | 6.4 | 4.25 | 7.86 | 6.31 | 6.11 | 4.19 | 4.04 | |
SMAPE | 2929–3112 | 42.02 | 27.58 | 24.74 | 14.32 | 28.14 | 21.67 | 22.38 | 14.66 | 14.11 |
2929–3295 | 62.34 | 30.63 | 27.23 | 15.51 | 24.48 | 21.27 | 21.97 | 14.71 | 14.48 | |
2929–3478 | 74.18 | 29.9 | 26.73 | 14.87 | 25.85 | 21.74 | 22.47 | 14.66 | 14.28 |
Note: a value in bold font indicates that the corresponding method achieves the best index among all the methods.
Comparison of the prediction indexes of electricity consumption in Tetuan.
Index | Time |
AR | NAR | SVR | LSTM | IFIG-FIS | TFIG-FIS | GFIG-FIS | LFIG-FIS | GPFIG-FIS |
---|---|---|---|---|---|---|---|---|---|---|
RMSE | 1909–1926 | 3479 | 3128 | 5556 | 2934 | 1630 | 1155 | 1033 | 1087 | 692 |
1927–1944 | 2800 | 2749 | 4130 | 2909 | 1647 | 1394 | 1306 | 854 | 565 | |
1945–1962 | 3030 | 3896 | 3415 | 3522 | 1394 | 1146 | 1077 | 718 | 482 | |
1963–1980 | 3803 | 4765 | 3347 | 3343 | 1790 | 1701 | 1676 | 1141 | 691 | |
1981–1998 | 6020 | 6815 | 4504 | 3755 | 1687 | 1603 | 1547 | 1101 | 744 | |
1999–2016 | 6500 | 7273 | 4526 | 3828 | 1823 | 1657 | 1617 | 1065 | 754 | |
SMAPE | 1909–1926 | 19.11 | 16.68 | 32.89 | 16.87 | 8.27 | 5.55 | 5.12 | 5.17 | 3.36 |
1927–1944 | 13.77 | 13.35 | 20.14 | 14.79 | 7.64 | 6.22 | 5.99 | 3.68 | 2.62 | |
1945–1962 | 14.92 | 18.04 | 14.94 | 17.29 | 6.12 | 4.42 | 4.33 | 2.88 | 2.07 | |
1963–1980 | 16.89 | 21.15 | 13.29 | 15.36 | 7.08 | 5.48 | 6.38 | 4.21 | 2.7 | |
1981–1998 | 22.99 | 27.07 | 17.05 | 16.61 | 6.66 | 5.33 | 5.81 | 4.16 | 2.95 | |
1999–2016 | 25.47 | 29.38 | 17.38 | 16.65 | 6.97 | 5.68 | 6.11 | 4.08 | 3.02 |
Note: a value in bold font indicates that the corresponding method achieves the best result among all the methods.
Comparison of the prediction indexes of ECG time series.
Index | Time (4 ms) | AR | NAR | SVR | LSTM | IFIG-FIS | TFIG-FIS | GFIG-FIS | LFIG-FIS | GPFIG-FIS |
---|---|---|---|---|---|---|---|---|---|---|
RMSE | 4001–4025 | 87.91 | 22.74 | 41.06 | 26.83 | 7.33 | 7.43 | 7.7 | 7.2 | 7.96 |
4026–4050 | 99.24 | 40.4 | 48.48 | 22.58 | 8.61 | 10.67 | 13.03 | 12.01 | 11.84 | |
4051–4075 | 102.95 | 52.92 | 51 | 94.60 | 12.23 | 12.34 | 13.82 | 12.93 | 12.1 | |
4076–4100 | 102.58 | 67.47 | 50.23 | 85.65 | 12.68 | 12.85 | 14.2 | 12.27 | 11.31 | |
4101–4125 | 124.51 | 129.95 | 83.39 | 114.43 | 70.03 | 64.81 | 64.07 | 59.72 | 30.74 | |
4126–4150 | 131.99 | 164.89 | 93.48 | 130.28 | 107.32 | 80.48 | 76.25 | 65.06 | 29.18 | |
4151–4175 | 122.86 | 163.78 | 89.79 | 139.07 | 100.47 | 76.09 | 72.11 | 60.88 | 28.41 | |
4176–4200 | 127.1 | 154.01 | 111.27 | 157.61 | 95.37 | 73.1 | 69.92 | 60.01 | 28.8 | |
4201–4225 | 121.25 | 160.38 | 105.42 | 149.67 | 93.44 | 69.98 | 66.71 | 56.75 | 27.4 | |
4226–4250 | 117.79 | 169.45 | 100.35 | 142.36 | 88.9 | 66.46 | 63.34 | 53.9 | 26.14 | |
SMAPE | 4001–4025 | 95.44 | 22.32 | 45.72 | 28.70 | 6.48 | 6.75 | 6.36 | 6.31 | 7.43 |
4026–4050 | 106.77 | 39.06 | 52.28 | 22.25 | 7.59 | 9.55 | 11.5 | 10.69 | 10.63 | |
4051–4075 | 111.88 | 52.56 | 55.11 | 59.09 | 10.77 | 11.42 | 12.75 | 11.87 | 11.2 | |
4076–4100 | 113.48 | 67.23 | 54.62 | 56.81 | 11.84 | 12.38 | 13.52 | 11.25 | 10.38 | |
4101–4125 | 114.54 | 91.74 | 60.64 | 66.26 | 25.84 | 23.55 | 24.93 | 27.62 | 19.71 | |
4126–4150 | 113.47 | 127.12 | 56.93 | 81.95 | 70.16 | 26.25 | 32.58 | 33.95 | 19.31 | |
4151–4175 | 106.49 | 164.82 | 67.18 | 150.81 | 71.01 | 31.95 | 37.2 | 35.64 | 23.8 | |
4176–4200 | 109.68 | 149.17 | 81.82 | 160.37 | 67.17 | 32.85 | 37.63 | 36.3 | 24.15 | |
4201–4225 | 107.33 | 174.87 | 79.45 | 152.83 | 73.07 | 37.05 | 40.11 | 35.21 | 23.45 | |
4226–4250 | 107.68 | 190.28 | 74.94 | 141.83 | 68.52 | 34.43 | 36.93 | 32.6 | 22.05 |
Note: A value in bold font indicates that the corresponding method achieves the best result among all the methods.
References
1. Zhang, H.; Nguyen, H.; Bui, X.-N.; Biswajeet, P.; Mai, N.-L.; Vu, D.A. Proposing two novel hybrid intelligence models for forecasting copper price based on extreme learning machine and meta-heuristic algorithms. Resour. Policy; 2021; 73, 102195. [DOI: https://dx.doi.org/10.1016/j.resourpol.2021.102195]
2. Wang, H.; Luo, C.; Wang, X. Synchronization and identification of nonlinear systems by using a novel self-evolving interval type-2 fuzzy LSTM-neural network. Eng. Appl. Artif. Intell.; 2019; 81, pp. 79-93. [DOI: https://dx.doi.org/10.1016/j.engappai.2019.02.002]
3. Jiang, P.; Yang, H.; Li, H.; Wang, Y. A developed hybrid forecasting system for energy consumption structure forecasting based on fuzzy time series and information granularity. Energy; 2021; 219, 119599. [DOI: https://dx.doi.org/10.1016/j.energy.2020.119599]
4. Box, G.; Jenkins, G.; Reinsel, G. Forecasting and Control; 4th ed. Time Series Analysis; John Wiley & Sons: New York, NY, USA, 2008.
5. Moon, J.; Hossain, M.B.; Chon, K. AR and ARMA model order selection for time-series modeling with ImageNet classification. Signal Process.; 2021; 183, 108026. [DOI: https://dx.doi.org/10.1016/j.sigpro.2021.108026]
6. Xian, H.; Che, J. Unified whale optimization algorithm based multi-kernel SVR ensemble learning for wind speed forecasting. Appl. Soft Comput.; 2022; 130, 109690. [DOI: https://dx.doi.org/10.1016/j.asoc.2022.109690]
7. Yoon, H.; Hyun, Y.; Ha, K.; Lee, K.-K.; Kim, G.-B. A method to improve the stability and accuracy of ANN- and SVM-based time series models for long-term groundwater level predictions. Comput. Geosci.; 2016; 90, pp. 144-155. [DOI: https://dx.doi.org/10.1016/j.cageo.2016.03.002]
8. Sunayana,; Kumar, S.; Kumar, R. Forecasting of municipal solid waste generation using non-linear autoregressive (NAR) neural models. Waste Manag.; 2021; 121, pp. 206-214. [DOI: https://dx.doi.org/10.1016/j.wasman.2020.12.011] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33360819]
9. Bandara, K.; Bergmeir, C.; Smyl, S. Forecasting across time series databases using recurrent neural networks on groups of similar series: A clustering approach. Expert Syst. Appl.; 2020; 140, 112896. [DOI: https://dx.doi.org/10.1016/j.eswa.2019.112896]
10. Arsov, M.; Zdravevski, E.; Lameski, P.; Corizzo, R.; Koteli, N.; Gramatikov, S.; Mitreski, K.; Trajkovik, V. Multi-Horizon Air Pollution Forecasting with Deep Neural Networks. Sensors; 2021; 21, 1235. [DOI: https://dx.doi.org/10.3390/s21041235]
11. Livieris, I.E.; Pintelas, E.; Pintelas, P. A CNN–LSTM model for gold price time-series forecasting. Neural Comput. Appl.; 2020; 32, pp. 17351-17360. [DOI: https://dx.doi.org/10.1007/s00521-020-04867-x]
12. Jin, X.; Yu, X.; Wang, X.; Bai, Y.; Su, T.; Kong, J. Prediction for Time Series with CNN and LSTM. Proceedings of the 11th International Conference on Modelling, Identification and Control (ICMIC2019); Tianjin, China, 4 December 2019; Wang, R.; Chen, Z.; Zhang, W.; Zhu, Q. Springer: Singapore, 2020; 59.
13. Jilani, T.A.; Burney, S.M.A. M-factor high order fuzzy time series forecasting for road accident data: Analysis and design of intelligent systems using soft computing techniques. Adv. Soft Comput.; 2007; 41, pp. 246-254.
14. Wang, L.; Xue, T.; Wang, H.; Liu, Z. Research on stock index forecasting based on recurrent neural network. J. Zhejiang Univ. Technol.; 2019; 47, pp. 186-191.
15. Wang, X.; Wu, J.; Liu, C.; Yang, H.; Niu, W. Exploring LSTM based recurrent neural network for failure time series prediction. J. Beijing Univ. Aeronaut. Astronaut.; 2018; 44, pp. 772-784.
16. Tang, Y.; Yu, F.; Pedrycz, W.; Yang, X.; Wang, J.; Liu, S. Building trend fuzzy granulation based LSTM recurrent neural network for long-term time series forecasting. IEEE Trans. Fuzzy Syst.; 2022; 30, pp. 1599-1613. [DOI: https://dx.doi.org/10.1109/TFUZZ.2021.3062723]
17. Cheng, R.; Yu, J.; Zhang, M.; Feng, C.; Zhang, W. Short-term hybrid forecasting model of ice storage air-conditioning based on improved SVR. J. Build. Eng.; 2022; 50, 104194. [DOI: https://dx.doi.org/10.1016/j.jobe.2022.104194]
18. Keogh, E.; Kasetty, S. On the need for time series data mining benchmarks: A survey and empirical demonstration. Data Min. Knowl. Discov.; 2002; 7, pp. 102-111.
19. Zadeh, L. Advances in Fuzzy Set Theory and Applications; World Scientific Publishing: Amsterdam, The Netherlands, 1979; pp. 3-18.
20. Pedrycz, W.; Vukovich, G. Abstraction and specialization of information granules. IEEE Trans. Syst. Man Cybern. Part B; 2001; 31, pp. 106-111. [DOI: https://dx.doi.org/10.1109/3477.907568]
21. Guo, J.; Lu, W.; Yang, J.; Liu, X. A rule-based granular model development for interval-valued time series. Int. J. Approx. Reason.; 2021; 136, pp. 201-222. [DOI: https://dx.doi.org/10.1016/j.ijar.2021.06.009]
22. Ruan, J.; Wang, X.; Shi, Y. Developing fast predictors for large-scale time series using fuzzy granular support vector machines. Appl. Soft Comput.; 2013; 13, pp. 3981-4000. [DOI: https://dx.doi.org/10.1016/j.asoc.2012.09.005]
23. Zhou, Y.; Ren, H.; Li, Z.; Pedrycz, W. Anomaly detection based on a granular Markov model. Expert Syst. Appl.; 2022; 187, 115744. [DOI: https://dx.doi.org/10.1016/j.eswa.2021.115744]
24. He, L.; Chen, Y.; Zhong, C.; Wu, K. Granular Elastic Network Regression with Stochastic Gradient Descent. Mathematics; 2022; 10, 2628. [DOI: https://dx.doi.org/10.3390/math10152628]
25. Hu, M.; Wang, C.; Yang, J.; Wu, Y.; Fan, J.; Jing, B. Rain Rendering and Construction of Rain Vehicle Color-24 Dataset. Mathematics; 2022; 10, 3210. [DOI: https://dx.doi.org/10.3390/math10173210]
26. Yu, F.; Pedrycz, W. The design of fuzzy information granules: Tradeoffs between specificity and experimental evidence. Appl. Soft Comput.; 2009; 9, pp. 264-273. [DOI: https://dx.doi.org/10.1016/j.asoc.2007.10.026]
27. Pedrycz, W.; Wang, X. Designing fuzzy sets with the use of the parametric principle of justifiable granularity. IEEE Trans. Fuzzy Syst.; 2016; 24, pp. 489-496. [DOI: https://dx.doi.org/10.1109/TFUZZ.2015.2453393]
28. Dong, K. Time Series Information Granulation and Clustering Analysis Based on Granulation; Beijing Normal University: Beijing, China, 2005.
29. Yang, X.; Yu, F.; Pedrycz, W. Long-term forecasting of time series based on linear fuzzy information granules and fuzzy inference system. Int. J. Approx. Reason.; 2017; 81, pp. 1-27. [DOI: https://dx.doi.org/10.1016/j.ijar.2016.10.010]
30. Luo, C.; Song, X.; Zheng, Y. A novel forecasting model for the long-term fluctuation of time series based on polar fuzzy information granules. Inf. Sci.; 2020; 512, pp. 760-779. [DOI: https://dx.doi.org/10.1016/j.ins.2019.10.020]
31. Luo, C.; Wang, H. Fuzzy forecasting for long-term time series based on time-variant fuzzy information granules. Appl. Soft Comput.; 2020; 88, 106046. [DOI: https://dx.doi.org/10.1016/j.asoc.2019.106046]
32. Tang, Y.; Yu, F. Fuzzy information granulation: Review of theory and applications. J. Beijing Norm. Univ.; 2022; 58, pp. 349-361.
33. Zadeh, L.A. Fuzzy sets. Inf. Control; 1965; 8, pp. 338-353. [DOI: https://dx.doi.org/10.1016/S0019-9958(65)90241-X]
34. Shen, Y. Calculus for linearly correlated fuzzy number-valued functions. Fuzzy Sets Syst.; 2022; 429, pp. 101-135. [DOI: https://dx.doi.org/10.1016/j.fss.2021.02.017]
35. Rote, G. Computing the minimum Hausdorff distance between two point sets on a line under translation. Inf. Process. Lett.; 1991; 38, pp. 123-127. [DOI: https://dx.doi.org/10.1016/0020-0190(91)90233-8]
36. Dubois, D.; Prade, H. Fundamentals of Fuzzy Sets; Springer: New York, NY, USA, 2000; 90.
37. Yu, F.; Dong, K.; Chen, F.; Jiang, Y.; Zeng, W. Clustering Time Series with Granular Dynamic Time Warping Method. Proceedings of the Clustering Time Series with Granular Dynamic Time Warping Method; Fremont, CA, USA, 2–4 November 2007; 393.
38. Luo, C.; Yu, F.; Zeng, W. Introduction to Fuzzy Sets; Beijing Normal University Press: Beijing, China, 2019.
39. Fan, R.; Chang, K.; Hsieh, C.; Wang, X.R.; Lin, C.J. LIBLINEAR: A library for large linear classification. J. Mach. Learn. Res.; 2008; 9, pp. 1871-1874.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Fuzzy information granulation transfers the time series analysis from the numerical platform to the granular platform, which enables us to study the time series at a different granularity. In previous studies, each fuzzy information granule in a granular time series can reflect the average, range, and linear trend characteristics of the data in the corresponding time window. In order to get a more general information granule, this paper proposes polynomial fuzzy information granules, each of which can reflect both the linear trend and the nonlinear trend of the data in a time window. The distance metric of the proposed information granules is given theoretically. After studying the distance measure of the polynomial fuzzy information granule and its geometric interpretation, we design a time series prediction method based on the polynomial fuzzy information granules and fuzzy inference system. The experimental results show that the proposed prediction method can achieve a good long-term prediction.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details

1 Key Laboratory of Intelligent Computing and Information Processing, Quanzhou Normal University, Quanzhou 362000, China; Fujian Key Laboratory of Financial Information Processing, Putian University, Putian 351100, China; School of Mathematical Science, Beijing Normal University, Beijing 100875, China; Fujian Provincial Key Laboratory of Data-Intensive Computing, Quanzhou Normal University, Quanzhou 362000, China; Fujian Big Data Research Institute of Intelligent Manufacturing, Quanzhou Normal University, Quanzhou 362000, China
2 Key Laboratory of Intelligent Computing and Information Processing, Quanzhou Normal University, Quanzhou 362000, China; Fujian Provincial Key Laboratory of Data-Intensive Computing, Quanzhou Normal University, Quanzhou 362000, China
3 Fujian Key Laboratory of Financial Information Processing, Putian University, Putian 351100, China
4 School of Mathematical Science, Beijing Normal University, Beijing 100875, China