Full Text

Turn on search term navigation

This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

1. Introduction

Freeway short-term traffic prediction models have been researched extensively in the literature [1–8]. The strong interest in these models is that they can be used to provide road operators with predictive intelligence tools to help them optimize freeway operations and avoid traffic breakdowns. These models have been developed from a variety of theoretical backgrounds including statistical techniques and artificial intelligence (AI) methods based on neural networks [9]. With the development of big data and complex computational intelligence, AI methods can predict future traffic more accurately than statistical models. In particular, deep learning networks can represent traffic dynamic behaviour and have recently achieved massive success in time series modelling. An example of recent models is the unidirectional long short-term memory (Uni-LSTM) recurrent neural network and its extension bidirectional long short-term memory (BiLSTM). Previous research has shown that the Uni-LSTM model has an effective prediction in handling long-term dependencies as it remembers useful information from inputs that has already passed through using “additional gates” incorporated in its architecture [10–12]. However, in more recent years, a bidirectional LSTM (BiLSTM) model has been investigated which offers an additional training capability as the output layer receives information from past (backwards) and future (forward) instances simultaneously and it provides better prediction accuracy [13–16]. In this paper, we assess the performance of Uni-LSTM and BiLSTM for different time horizons using speed and flow field data for multiple freeways in Australia. The main research questions are as follows:

(1) Will results be improved if speed and flow data are trained from both directions (forward direction and backward direction)?

(2) How adding layers or mixing both LSTM and BiLSTM as one model affects the model performance?

(3) If the model is trained and tested for one freeway, will it achieve a good accuracy if validated only (without retraining) on an independent dataset from a different freeway?

This paper aims to address these questions and demonstrate the feasibility of using advanced AI techniques based on deep learning Uni-LSTM and BiLSTM models to predict speed and flow for multiple prediction horizons. It provides a comparative performance analysis of both Uni-LSTM and BiLSTM based on a common dataset of field measurements. The models are developed using historical data extracted from sensors embedded in pavements on three freeways in Australia: the Pacific Motorway between Brisbane and the Gold Coast in Queensland, Tullamarine Freeway in Melbourne, and South Eastern Freeway in Melbourne. This paper also investigates whether additional layers of training improve prediction accuracies for both speed and flow. To our knowledge, there have been limited papers targeting the BiLSTM model for future traffic prediction, and this paper shows the robust performance of this extension of the Uni-LSTM model. Also, this paper validates the performance of a developed model on different freeways which makes this work a valuable contribution to knowledge in the intelligent transport systems and network operations fields. This provides road operators and transport agencies with confidence that they can apply these models on different freeways even if they have not embarked on comprehensive historical data collection efforts for the target freeways. This also helps them with reducing the cost of deployment of these algorithms by avoiding the need to preprocess new data and calibrate and validate new models which is a time-consuming undertaking that requires substantial resources and experienced and well-trained AI staff and specialists.

This paper is organized as follows: Section 2 provides a scan of previous research work. Section 3 presents the methodology including data collection and modelling frameworks. Section 4 presents the results of the comparative evaluation of different models. Section 5 presents the performance of stacked and mixed Uni-LSTM and BiLSTM models. Section 6 shows the model validation results, and finally, Section 7 presents the conclusions and future research directions.

2. Literature Review

The prediction and forecasting of short-term (1 to 60 minutes into the future) traffic conditions plays an important role in the success of intelligent transport systems (ITSs) such as travel information systems, adaptive traffic management systems, public transportation scheduling, and commercial vehicle operations [17–19]. Due to the wide body of literature on this topic, we focus the literature scan in this section on traffic prediction using LSTM models, which used field traffic data collected from inductive loop detectors, CCTV, probe vehicles, and incident reports. A comprehensive review of other models including those that used simulated data can be found in [17]. Increasingly, road operators have more confidence in models that have been developed using real-life data and hence our focus in this work on model development and evaluations using data from real-world environments.

Methodologies used for flow and speed prediction can be classified into two broad parametric and nonparametric approaches [20]. Examples of commonly used parametric methods include linear models such as autoregressive integrated moving average model (ARIMA) [21], seasonal ARIMA, i.e., SARIMA model [22], exponential smoothing model [23], and ARIMA with Kalman filter (KF) [24, 25]. These parametric methods perform poorly with dynamic traffic patterns, which limits their application in complex traffic prediction compared to nonparametric methods. Nonparametric methods are more capable of predicting a stochastic pattern of input traffic data and are better at handling noisy data.

With the recent advancements in machine learning, many models have shown a promising potential in solving nonlinear problems and handling long-term dependencies. Examples include LSTM and BiLSTM models. These models were previously used to forecast future traffic speeds [10], travel times [18], and traffic flows [11]. In one study, the long short-term memory (LSTM) structure was applied for future speed prediction and showed that it provides higher performance compared to classical methods [10]. Another study showed that using LSTM models is promising for irregular travel time prediction models as the error for 1-step-ahead prediction error is relatively small [18]. Other studies have shown that flow prediction using LSTM achieved high accuracy compared to other models for different prediction horizons [11]. Also, LSTM models have been developed in other studies on car-following models to predict acceleration and deceleration on different road hierarchies [26].

Short-term traffic flow using LSTM has also been investigated where the dependency relationships of time series data were fully considered, and experimental results showed a very good performance with an error of 5.4% when compared with other models [27]. In other studies, an end-to-end deep learning model has been investigated to predict future traffic flows [28] where one BiLSTM layer was added, and the results showed that the model was capable of solving stochastic flow characteristics and overcoming overfitting problems [28]. Similarly, stacking BiLSTM and Uni-LSTM models were developed in another study to predict network-wide traffic speeds. The results showed that the stacked architecture outperforms both BiLSTM and Uni-LSTM models [29].

In another study, different models were developed that showed superior performance when using deeper BiLSTM layers for urban traffic prediction [30]. Other researchers have also used LSTM and RNN approaches for speed prediction models under various urban driving conditions with credible and accurate results [31]. LSTM and gated recurrent units (GRUs) were also applied in recent studies to predict the general condition of driving speed in consideration of the road geometry and temporal evolution of traffic demand. The results showed superior LSTM model performance compared to regression models [32]. Similarly, superior model performance has been shown from using LSTM and GRU models when compared to ARIMA and support vector regression (SVR) models for the track flow prediction [33].

In other studies, a variational long short-term memory encoder was examined to predict traffic flow which provided better prediction in comparison to other conventional methods used [34]. Similarly, a long short-term memory-genetic algorithm support vector regression (LSTM-GASVR) short-term traffic flow prediction algorithm was reported to predict future traffic flows with better accuracies than LSTM, GRU, convolutional neural networks (CNNs), stacked autoencoder (SAE), ARIMA, and support vector regression (SVR) models tested in the same study [35].

Furthermore, LSTM models have also been developed for momentary traffic stream forecasts, which aim to help transport authorities in decision-making during rush hour for gridlock prediction since the model remembers information for longer periods of time than other models [36]. Also, the validity of LSTM models has been verified in studies on prediction of short-term traffic flow and were found to provide high prediction accuracies for flow data [37]. Other studies have documented superior performance when ARIMA and long short-term memory (LSTM) neural networks were combined for short-term traffic flow prediction [38]. In another recent study, type-2 fuzzy LSTM (T2F-LSTM) was developed for long-term traffic volume prediction and extraction of spatial-temporal characteristics of traffic volumes and was found to achieve high prediction accuracies [39].

In summary, a substantial number of studies in the literature have addressed short-term traffic prediction with robust LSTM models. However, only few studies have addressed the promising potential of BiLSTM models for traffic time series future prediction that consider the backward temporal dependencies. Another important contribution in this work is the comparison of stacked and mixed BiLSTM and LSTM layers for model accuracy improvement. In addition, this paper discusses the model applicability when being developed on parameters of one location and validated only (without retraining) on different locations. Furthermore, the models are developed using field data that comprised diverse and complex traffic characteristics (including peak, nonpeak, weekday, weekend, incident, and nonincident data). Another important factor in this work is that the data used for model development have been methodically screened, preprocessed, and validated in a large number of previous studies [40, 41].

3. Study Methodology

This section of the paper presents the study methodology including data collection, model development, evaluation tests, and analyses.

3.1. Data for Model Development

Neural network applications require large amounts of data that are needed for model development [41, 42]. The data are typically divided into a training dataset used for model calibration and a testing dataset used for model verification. The validity of the model is tested on an independent dataset not used in model training, referred to as the testing dataset. In this research, the data used for model development included traffic speed and flow measurements collected from sensors installed on a number of freeways in Australia. The data were collected over a number of years and time periods including peak, nonpeak, weekday, and weekend conditions. Another unique characteristic of the data is that they include incident traffic conditions which are usually difficult to capture for model development. Such incidents, which include road crashes, broken down vehicles, and similar nonrecurrent events, typically result in a significant capacity reduction of the freeway and last for long durations. Including these data in model training and validation improves the robustness of the prediction models.

3.1.1. Dataset 1: Pacific Motorway, Queensland

This dataset was collected from a section of the Pacific Motorway between Brisbane and the Gold Coast in Queensland [40]. The length of the section is around 1.5 km. Speed and flow data were gathered from 4 detection stations (S0–S3) which include inductive loop sensors installed at approximately 500 m interval as shown in Figure 1. These data were collected for a period of 5 hours (2 hours peak and 3 hours off-peak traffic conditions). The data comprised normal traffic conditions that did not include any incidents. A total of 1,667 observations were gathered. For this study, the data were divided into 1000 observations for training (60% of the total dataset) and 667 observations for testing (40% of the total dataset).

[figure omitted; refer to PDF]

The data from the two Melbourne freeways are also important in that each freeway carries more than 100,000 vehicles per day. The incident data collected from these freeways (100 in total) had varying characteristics that included a representative range of incidents on freeways. For example, four incidents resulted in blocking one lane of traffic, 77 resulted in blocking 2 lanes, and 19 resulted in blocking three lanes. Five of the incidents occurred during low-flow conditions (below 700 vphpl), 58 during heavy-flow conditions (above 1550 vphpl), and 37 during moderate flow conditions. Twenty-five incidents also occurred during peak-hour traffic conditions. As for the distribution of incident duration, 26 incidents lasted for less than 30 minutes; 32 lasted between 30 and 60 minutes; 30 lasted between 60 and 90 minutes; and 12 lasted more than 90 minutes [41].

Samples of speed and flow data for the three freeways are shown in Figures 4 and 5, respectively. These figures represent a small portion of the data for illustrative purposes. Figure 4 shows speed patterns in km/h during AM peak between 5:30 and 8:00 AM. The figure demonstrates that the South Eastern Freeway is the most congested freeway with a speed lower than 20 km/h followed by the Pacific Motorway and Tullamarine Freeway. As for Figure 5, the flow patterns are illustrated in veh/h for the period from 9 AM to 12 PM. The figure shows that each freeway behaves differently during the same period of time as the flow ranges between 7 and 56 vehicles per hour for the three freeways. In summary, the real-life datasets used in this study are considered to be one of the most diverse and representative field traffic datasets available particularly in the Australian context. They are also unique in that they have been meticulously screened, cleaned, preprocessed, and validated in a large number of studies [15].

[figure omitted; refer to PDF]

In these models, the following formulae are used to calculate the predicted values: $\begin{matrix} (1) & input gate I_{t} = σ_{g} W_{i} X_{t} + R_{i} h_{t - 1} + b_{i}, \\ forget gate f_{t} = σ_{g} W_{f} X_{t} + R_{f} h_{t - 1} + b_{f}, \\ cell candidate C_{t} = σ_{c} W_{c} X_{t} + R_{c} h_{t - 1} + b_{c}, \\ output gate o_{t} = σ_{g} W_{o} X_{t} + R_{o} h_{t - 1} + b_{o}, \end{matrix}$ where $σ_{g}$ is the gate activation function and $W_{i}, W_{f}, W_{c}, and W_{o}$ are input weight matrices.

$R_{i}, R_{f}, R_{c}, and R_{o}$ are recurrent weight matrices, $X_{t}$ is the input, and $h_{t - 1}$ is the output at the previous time (t − 1). $b_{i}, b_{f}, b_{c}, and b_{o}$ are bias vectors. The forget gate determines how much of the prior memory values should be removed from the cell state. Similarly, the input gate specifies new input to the cell state. Then, the cell state C_t and the output H_t of the LSTM at time t are calculated as follows: $\begin{matrix} (2) & C_{t} = f t ⊙ c t ⊙ 1 + i t ⊙ g t, \\ H_{t} = o t ⊙ σ c c t, \end{matrix}$ where ⊙ denotes the Hadamard product (element-wise multiplication of vectors).

In this work, the unidirectional and bidirectional LSTM networks were implemented in MATLAB R2020b. First, the data were arranged as two column values: the first column corresponds to speed/flow at time t and the second column corresponds to the expected output (t + n) where n ranges from 5 minutes to 60 minutes into the future. Then, the data were partitioned into training and testing sets. The models were trained on the first 60% of the sequence and tested on the last 40%. To prevent model overfitting, the training/testing data were standardized to have zero mean and unit variance. The LSTM networks were created using four layers: sequence input layer (number of features = 1), Uni-LSTM/BiLSTM layers (number of hidden units = 300), fully connected layer (number of responses = 1), and a regression layer. The model hyperparameter settings are presented in Table 1. Multiple sets of hyperparameters were tested with the aim to find the right combination of values which result in the best accuracy. Table 1 shows the parameters that provided the optimal results. The tanh and sigmoid functions were used for state and gate activation functions, respectively. The LSTM experiments were also implemented in MATLAB R2020b with the Deep Learning Toolbox functions of trainNetwork, training options, and predictAndUpdateState.

Table 1

Model hyperparameters for Uni-LSTM and BiLSTM.

Gradient decay factor	0.9
Initial learning rate	0.005
Minimum batch size	128
Maximum epochs	300
Training optimizer	Adaptive moment estimation optimizer
Dropping learning rate during training	Piecewise
Learning rate drop period	125
Factor for learning rate dropping	0.2

4. Comparative Evaluation of Uni-LSTM and BiLSTM

The first set of results in this paper was for speed and flow data for the Tullamarine Freeway in Melbourne (Table 2) and the Pacific Motorway in Brisbane (Table 3). The data from both freeways were divided into 60% training data and 40% testing data. The mean absolute percentage error (MAPE) is used to calculate the accuracy of the model prediction for different time horizons. MAPE calculates the average absolute difference between the predicted output from the model (Y1) and expected true output (Y): $\begin{matrix} (3) & MAPE % = \frac{1}{n} \sum_{i = 1}^{n} \frac{Y - Y 1}{Y} \times 100, \\ accuracy % = 100 - MAPE . \end{matrix}$

Table 2

Speed performance for different prediction horizons for the Tullamarine Freeway.

Table 3

Flow performance for different prediction horizons for the Tullamarine Freeway.

The speed prediction results showed that BiLSTM and Uni-LSTM achieve high prediction results up to 60 minutes into the future. BiLSTM outperforms Uni-LSTM with accuracies above 92.6% up to 60 minutes for the Tullamarine Freeway. For a prediction horizon up to 60 minutes, accuracy improvements over Uni-LSTM were 7% for 5 minutes, 6% for 10 minutes, 7% for 15 minutes, 13% for 30 minutes, and 15% and 16% for 45 and 60 minutes, respectively. For the Pacific Motorway, BiLSTM outperforms Uni-LSTM up to 15 minutes, and then Uni-LSTM presents better results up to 60 minutes; however, the two models produce similar results for the 60-minute prediction horizon (e.g., 93.6% versus 92.7% as shown in Table 2.

For the Pacific Motorway, the results showed that BiLSTM outperformed Uni-LSTM for up to 45 minutes with an accuracy improvement of 14% for 5 minutes; 14% for 10 minutes; 9% for 15 minutes; 8% for 30 minutes; and 2% for 45 minutes. For 60-minute prediction horizons, the percentage differences in accuracies between Uni-LSTM and BiLSTM were found to be minimal (0.01%) as reported in Table 3. In Tables 2 and 3, the cells highlighted in green denote best-performing models, and cells highlighted in yellow denote second best-performing models.

5. Deep and Mixed Unidirectional and Bidirectional LSTM

In this section, the results for multiple Uni-LSTM and BiLSTM layers to improve the results for both speed and flow are presented. Also, results for combining both LSTM and BiLSTM layers are presented for the 15-minute horizon for the Tullamarine Freeway and Pacific Motorway. To our knowledge, limited publications have tested deep architectures of BiLSTM and mixed models to measure the backward dependency of traffic speed and flow prediction.

The results provided in Tables 4 and 5 show that deep BiLSTM with combined layers outperforms Uni-LSTM and deep Uni-LSTM models for 15-minute prediction horizons on both freeways. For speed, 3-BiLSTM layers and 4-BiLSTM layers provide the best accuracy of 98% on the Tullamarine Freeway, while LSTM layers provide the lowest accuracy of around 94%. The 4-layered BiLSTM model outperformed other models with 92.5% accuracy for 15-minute prediction horizons on the Tullamarine Freeway. Similarly, Pacific Motorway experiments show that the 4-layer BiLSTM model outperformed other models with an accuracy of 99.99% as shown in Tables 4 and 5.

Table 4

Speed and flow performance for different prediction horizons for the Tullamarine Freeway.

Table 5

Speed and flow performance for different prediction horizons for the Pacific Motorway.

Figures 8 and 9 present the speed and flow results for the 15-minute prediction horizon using the best-performing 4-layered BiLSTM model. These figures compare the target or expected values (blue trendline) with the predicted values from the model (orange trendline). Figure 8 shows the superior performance of the model for predicting both speed and flow with accuracies of 98% and 92.50% for 15-minute prediction horizons on the Tullamarine Freeway. Similarly, Figure 9 shows a remarkable prediction performance for the 4-layered BiLSTM model on the Pacific Motorway with a prediction accuracy of 99.99% for both speed and flow for 15-minute prediction horizons.

[figures omitted; refer to PDF]

6. Deep BiLSTM Model Validation and Transferability

This section of the paper presents results for model validation and potential for transferability to other freeways without the need for recalibration and retraining. If this can be achieved even at the expense of a depreciated accuracy, it can provide road operators and transport agencies with confidence that they can apply existing models on different freeways even if they have not embarked on comprehensive historical data collection efforts. This also helps them with reducing the cost of deployment of these algorithms by avoiding the need to preprocess new data and calibrate and validate new models which is a time-consuming undertaking that requires substantial resources and experienced and well-trained AI staff and specialists.

The model validation experimental design is shown in Figure 10. The learning obtained from the previous comparative evaluations was used to develop robust speed and flow prediction models using data combined from the two largest datasets (Tullamarine and South Eastern Freeways in Melbourne). The data used in model development included 24,270 observations and were divided into two sets: training set comprising 60% of the data (14,562 observations) and testing set comprising 40% of the data (9,708 observations). The validation dataset included 1,667 observations from the third freeway (Pacific Motorway in Brisbane). The model development results are provided in the first set of columns in Table 6 (Tullamarine and South Eastern Freeways). For speed, the model accuracy ranged from 99.7% for 5-minute forecasting horizons to 91.8% for 60-minute forecasting horizons. For flow, the model accuracy ranged from 99.6% for 5-minute forecasting horizons to 71.2% for 60-minute forecasting horizons. The model was then validated (without retraining) on the third independent dataset from the Pacific Motorway in Brisbane.

[figure omitted; refer to PDF]

Table 6

Speed and flow validation results for different prediction horizons.

Prediction horizons	Tullamarine Freeway + South Eastern Freeway				Validation on the Pacific Motorway
	Speed (km/h)		Flow (veh/h)		Speed (km/h)		Flow (veh/h)
	MAPE (%)	Accuracy (%)	MAPE (%)	Accuracy (%)	MAPE (%)	Accuracy (%)	MAPE (%)	Accuracy (%)
5 mins	0.30	99.70	0.38	99.62	0.27	99.73	2.83	97.17
10 mins	2.61	97.39	4.15	95.85	3.02	96.98	9.26	90.74
15 mins	3.06	96.94	6.82	93.18	4.63	95.37	17.49	82.51
30 mins	8.14	91.86	11.26	88.74	6.28	93.72	17.91	82.09
45 mins	7.74	92.26	19.01	80.99	7.84	92.16	20.81	79.19
60 mins	8.17	91.83	28.77	71.23	9.83	90.17	26.55	73.45

The validation results are shown in the right side columns of Table 6. For speed, the model’s accuracy ranged from 99.7% for 5-minute forecasting horizons to 90.2% for 60-minute forecasting horizons. For traffic flow, the model’s accuracy ranged from 97.2% for 5-minute forecasting horizons to 82.1% for 30-minute forecasting horizons. The performance degrades to 79.19% for 45-minute and 73.45% for 60-minute prediction horizons, as shown in Table 6. These findings are also depicted in Figures 11–16.

[figures omitted; refer to PDF]

In Figures 11–16, the blue trendline represents the targeted real data compared to the orange trendline which represents the results generated from the model. In Figure 13, the difference between targeted and predicted results for both speed and flow is minimal as the MAPE percentages between the two are 0.27% and 2.83%, respectively. In Figure 14, the speed results for 10-minute prediction horizons also demonstrate a low MAPE percentage error of 3.02% compared to 9.26% for flow. As expected, it can be noted that the error increases as the prediction horizon increases. In Figure 15, the MAPE percentage error increases minimally for 15-minute prediction horizons for speed (4.63% increase) compared to a high increase in error for flow (17.49%). The same behaviour is observed for 30-minute prediction horizons as the errors for speed and flow increase to 6.28% and 17.91% as shown in Figure 16. These results suggest that the model is able to accurately validate speed to multiple prediction horizons. The flow prediction results also showed good accuracies higher than 80% for 5, 10, 15, and 30 minutes into the future using the 4-layered BiLSTM model.

7. Conclusions, Contributions, and Future Research Directions

In this paper, unidirectional and bidirectional LSTM networks were developed to predict speed and flow on freeways for forecasting horizons up to 60 minutes into the future. The models were evaluated based on historical field data collected from inductive loop sensors on a number of freeways in Australia. A comprehensive and rigorous procedure was adopted to evaluate the suitability of different architectures and modelling parameters. The results showed a superior performance for the bidirectional compared to unidirectional LSTM. The results also demonstrated the challenges of predicting traffic flow, compared to speed. This was a result of the noisy nature of flow measurements compared to speed observations. For the Tullamarine Freeway, the BiLSTM model was able to achieve speed predictions up to 60 minutes into the future with an accuracy above 90%. For the flow prediction, the accuracy was above 80% up to 45 min forecasting horizons, outperforming the Uni-LSTM model. For the Pacific Motorway, BiLSTM also outperformed Uni-LSTM with accuracies above 88% for speed and above 80% for flow up to 60-minute prediction horizons.

This study also extended the models and evaluated their performance when adding multiple Uni-LSTM and BiLSTM layers or mixing both LSTM and BiLSTM as one model for 15-minute prediction horizons. The experiments showed that the 4-layered BiLSTM outperformed other models for both speed and flow on Tullamarine and Pacific Motorway datasets. Another contribution of this work was to examine model validation and its potential for transferability. The evaluation was undertaken on a combined dataset from the Tullamarine and South Eastern Freeways in Melbourne. This approach enabled us to train the models on a large dataset with different patterns and variable traffic conditions, including peak, nonpeak, weekday, weekend, and incident data. Once optimized, the model was validated by testing only (without retraining) on an independent dataset from a third freeway. The validation results showed speed prediction accuracies ranging from 99.7% for 5-minute forecasting horizons to 90.2% for 60-minute forecasting horizons. The flow validation prediction accuracies were lower and ranged between 74% and 97%. While it is acknowledged that more comprehensive testing is required on much larger numbers of freeways, this contribution demonstrates the potential to develop transferable models provided sufficient data are available to represent more diverse traffic conditions from different cities around the world.

Future directions in this research include collection of more field data from other real-life freeways in different cities both in Australia and overseas such as those reported in [44]. The use of microsimulation to generate edge case data that are difficult to measure in the field is also recommended [45, 46]. As was shown in this paper, training the models on different patterns and variable traffic conditions enabled us to develop robust models that can perform well on an independent dataset. With more data used for training and model development, it is expected that the accuracy will improve. The AI field is also witnessing a fast pace of developments and breakthroughs providing future opportunities to test new architectures to further improve the model performance and accuracy.

Acknowledgments

The first author would like to acknowledge her Ph.D. scholarship provided by the Iraqi Government and Swinburne University of Technology in Melbourne, Australia.

References

[1] H. Dia, D. Harney, A. Boyle, "Dynamics of drivers’ route choice decisions under advanced traveller information systems," Road & Transport Research, vol. 10 no. 4, 2001.

[2] R. Abduljabbar, H. DIA, "Predictive intelligence: a neural network learning system for traffic condition prediction and monitoring on freeways," Journal of the Eastern Asia Society for Transportation Studies, vol. 13, pp. 1785-1800, DOI: 10.11175/easts.13.1785, 2019.

[3] R. Abduljabbar, H. Dia, "A deep learning approach for freeway vehicle speed and flow prediction," Proceedings of the Australasian Transport Research Forum (ATRF), 41st, 2019, .

[4] K. Thomas, H. Dia, N. Cottman, "January. Simulation of arterial incident detection using neural networks," Proceedings of the 8th World Congress on ITS, .

[5] S. Nigarnjanagool, H. Dia, "Evaluation of a dynamic signal optimisation control model using traffic simulation," IATSS Research, vol. 29 no. 1, pp. 22-30, DOI: 10.1016/s0386-1112(14)60115-1, 2005.

[6] K. Thomas, H. Dia, "Comparative evaluation of freeway incident detection models using field data," IEE Proceedings-Intelligent Transport Systems, vol. 153 no. 3, pp. 230-241, 2006.

[7] J. Tang, F. Liu, Y. Zou, W. Zhang, Y. Wang, "An improved fuzzy neural network for traffic speed prediction considering periodic characteristic," IEEE Transactions on Intelligent Transportation Systems, vol. 18 no. 9, pp. 2340-2350, DOI: 10.1109/tits.2016.2643005, 2017.

[8] J. Tang, X. Chen, Z. Hu, F. Zong, C. Han, L. Li, "Traffic flow prediction based on combination of support vector machine and data denoising schemes," Physica A: Statistical Mechanics and Its Applications, vol. 534,DOI: 10.1016/j.physa.2019.03.007, 2019.

[9] H. Dia, G. Rose, A. Snell, "Comparative performance of freeway automated incident detection algorithms," Institute of Transport and Logistics Studies Working Paper ITS-WP-96-15, 1996. https://ses.library.usyd.edu.au/handle/2123/19426

[10] X. Ma, Z. Tao, Y. Wang, H. Yu, Y. Wang, "Long short-term memory neural network for traffic speed prediction using remote microwave sensor data," Transportation Research Part C: Emerging Technologies, vol. 54, pp. 187-197, DOI: 10.1016/j.trc.2015.03.014, 2015.

[11] Z. Zhao, W. Chen, X. Wu, P. C. Y. Chen, J. Liu, "LSTM network: a deep learning approach for short‐term traffic forecast," IET Intelligent Transport Systems, vol. 11 no. 2, pp. 68-75, DOI: 10.1049/iet-its.2016.0208, 2017.

[12] D. Kang, Y. Lv, Y. Y. Chen, "Short-term traffic flow prediction with LSTM recurrent neural network," .

[13] H. Zou, Y. Wu, H. Zhang, Y. Zhan, "Short-term traffic flow prediction based on PCC-BiLSTM," pp. 489-493, .

[14] J. Wang, F. Hu, L. Li, "Deep bi-directional long short-term memory model for short-term traffic flow prediction," pp. 306-316, .

[15] T. Sun, C. Yang, K. Han, W. Ma, F. Zhang, "Bidirectional spatial-temporal network for traffic prediction with multisource data," Transportation Research Record: Journal of the Transportation Research Board, vol. 2674 no. 8, pp. 78-89, DOI: 10.1177/0361198120927393, 2020.

[16] S. Siami-Namini, N. Tavakoli, A. S. Namin, "The performance of LSTM and BiLSTM in forecasting time series," pp. 3285-3292, .

[17] R. Abduljabbar, H. Dia, S. Liyanage, S. A. Bagloee, "Applications of artificial intelligence in transport: an overview," Sustainability, vol. 11 no. 1,DOI: 10.3390/su11010189, 2019.

[18] Y. Duan, Y. Lv, F. Y. Wang, "Travel time prediction with LSTM neural network," pp. 1053-1058, .

[19] G. Song, M. Shuai, K. Xie, X. Ma, "An on-road wireless sensor network approach for urban traffic state monitoring," pp. 1195-1200, .

[20] R. Lund, "Time series analysis and its applications: with R examples," Journal of the American Statistical Association, vol. 102 no. 479,DOI: 10.1198/jasa.2007.s209, 2007.

[21] M. G. Karlaftis, E. I. Vlahogianni, "Memory properties and fractional integration in transportation time-series," Transportation Research Part C: Emerging Technologies, vol. 17 no. 4, pp. 444-453, 2009.

[22] G. Fusco, C. Colombaroni, N. Isaenko, "Short-term speed predictions exploiting big data on large urban road networks," Transportation Research Part C: Emerging Technologies, vol. 73, pp. 183-201, 2016.

[23] C. Chen, J. Hu, Q. Meng, Y. Zhang, "Short-time traffic flow prediction with ARIMA-GARCH model," Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), pp. 607-612, .

[24] J. Guo, W. Huang, B. M. Williams, "Adaptive Kalman filter approach for stochastic short-term traffic flow rate prediction and uncertainty quantification," Transportation Research Part C: Emerging Technologies, vol. 43, pp. 50-64, 2014.

[25] M. Lippi, M. Bertini, P. Frasconi, "Short-term traffic flow forecasting: an experimental comparison of time-series analysis and supervised learning," IEEE Transactions on Intelligent Transportation Systems, vol. 14 no. 2, pp. 871-882, 2013.

[26] J. Morton, T. A. Wheeler, M. J. Kochenderfer, "Analysis of recurrent neural networks for probabilistic modeling of driver behavior," IEEE Transactions on Intelligent Transportation Systems, vol. 18 no. 5, pp. 1289-1298, 2016.

[27] H. Shao, B. H. Soong, "Traffic flow prediction with long short-term memory networks (LSTMs)," pp. 2986-2989, .

[28] Q. Zhaowei, L. Haitao, L. Zhihui, Z. Tao, "Short-term traffic flow forecasting method with MB-LSTM hybrid network," IEEE Transactions on Intelligent Transportation Systems, 2020.

[29] Z. Cui, R. Ke, Z. Pu, Y. Wang, "Deep bidirectional and unidirectional LSTM recurrent neural network for network-wide traffic speed prediction," 2018. https://arxiv.org/abs/1801.02143

[30] M. Lu, J. Pang, J. Li, "DeepBSTN: a deep bidirection network model for urban traffic prediction," .

[31] K. Yeon, K. Min, J. Shin, M. Sunwoo, M. Han, "Ego-vehicle speed prediction using a long short-term memory based recurrent neural network," International Journal of Automotive Technology, vol. 20 no. 4, pp. 713-722, 2019.

[32] Y. Chen, Y. Chen, B. Yu, "Speed distribution prediction of freight vehicles on mountainous freeway using deep learning methods," Journal of Advanced Transportation, vol. 2020,DOI: 10.1155/2020/8953182, 2020.

[33] W. Wang, H. Zhang, T. Li, "An interpretable model for short term traffic flow prediction," Mathematics and Computers in Simulation, vol. 171, pp. 264-278, DOI: 10.1016/j.matcom.2019.12.013, 2020.

[34] M. Farahani, M. Farahani, M. Manthouri, O. Kaynak, "Short-term traffic flow prediction using variational LSTM networks," 2020. https://arxiv.org/abs/2002.07922

[35] J. Zhou, H. Chang, X. Cheng, X. Zhao, "A multiscale and high-precision LSTM-GASVR short-term traffic flow prediction model," Complexity, vol. 2020,DOI: 10.1155/2020/1434080, 2020.

[36] P. Poonia, V. K. Jain, "Short-term traffic flow prediction: using LSTM," .

[37] C. Kang, Z. Zhang, "Application of LSTM in short-term traffic flow prediction," Proceedings of the 2020 IEEE 5th International Conference on Intelligent Transportation Engineering (ICITE), pp. 98-101, .

[38] S. Lu, Q. Zhang, G. Chen, D. Seng, "A combined method for short-term traffic flow prediction based on recurrent neural network," Alexandria Engineering Journal, vol. 60 no. 1, pp. 87-84, DOI: 10.1016/j.aej.2020.06.008, 2020.

[39] R. Li, Y. Hu, Q. Liang, "T2F-LSTM method for long-term traffic volume prediction," IEEE Transactions on Fuzzy Systems, vol. 28 no. 12,DOI: 10.1109/TFUZZ.2020.2986995, 2020.

[40] H. Dia, "An object-oriented neural network approach to short-term traffic forecasting," European Journal of Operational Research, vol. 131 no. 2, pp. 253-261, DOI: 10.1016/s0377-2217(00)00125-9, 2001.

[41] H. Dia, G. Rose, "Development and evaluation of neural network freeway incident detection models using field data," Transportation Research Part C: Emerging Technologies, vol. 5 no. 5, pp. 313-331, DOI: 10.1016/s0968-090x(97)00016-8, 1997.

[42] D. T. Larose, C. D. Larose, Discovering Knowledge in Data: an Introduction to Data Mining, vol. 4, 2014.

[43] K. Yeon, K. Min, J. Shin, M. Sunwoo, M. Han, "Ego-vehicle speed prediction using a long short-term memory based recurrent neural network," International Journal of Automotive Technology, vol. 20 no. 4, pp. 713-722, DOI: 10.1007/s12239-019-0067-y, 2019.

[44] C. Sutandi, H. Dia, "Performance evaluation of an advanced traffic control system in a developing country," Proceedings of the 6th EASTS Conference, vol. 5, pp. 1572-1584, .

[45] S. Panwai, H. Dia, "Development and evaluation of a reactive agent-based car following model," Proceedings of the Intelligent Vehicles and Road Infrastructure Conference (IVRI ’05), .

[46] K. Thomas, H. Dia, N. Cottman, "Simulation of arterial incident detection using neural networks," Proceedings of the 8th World Congress on Intelligent Transport Systems, .

Word count: 5568

Show less

Copyright © 2021 Rusul L. Abduljabbar et al. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

This paper presents the development and evaluation of short-term traffic prediction models using unidirectional and bidirectional deep learning long short-term memory (LSTM) neural networks. The unidirectional LSTM (Uni-LSTM) model provides high performance through its ability to recognize longer sequences of traffic time series data. In this work, Uni-LSTM is extended to bidirectional LSTM (BiLSTM) networks which train the input data twice through forward and backward directions. The paper presents a comparative evaluation of the two models for short-term speed and traffic flow prediction using a common dataset of field observations collected from multiple freeways in Australia. The results showed BiLSTM performed better for variable prediction horizons for both speed and flow. Stacked and mixed Uni-LSTM and BiLSTM models were also investigated for 15-minute prediction horizons resulting in improved accuracy when using 4-layer BiLSTM networks. The optimized 4-layer BiLSTM model was then calibrated and validated for multiple prediction horizons using data from three different freeways. The validation results showed a high degree of prediction accuracy exceeding 90% for speeds up to 60-minute prediction horizons. For flow, the model achieved accuracies above 90% for 5- and 10-minute prediction horizons and more than 80% accuracy for 15- and 30-minute prediction horizons. These findings extend the set of AI models available for road operators and provide them with confidence in applying robust models that have been tested and evaluated on different freeways in Australia.

Details

Title

Unidirectional and Bidirectional LSTM Models for Short-Term Traffic Prediction

Author

Abduljabbar, Rusul L¹

; Hussein Dia¹; Pei-Wei, Tsai²

¹ Department of Civil and Construction Engineering, Swinburne University of Technology, Melbourne, Australia
² Department of Computer Science and Software Engineering, Swinburne University of Technology, Melbourne, Australia

Editor

Jinjun Tang

Publication year

2021

Publication date

2021

Publisher

John Wiley & Sons, Inc.

ISSN

01976729

e-ISSN

20423195

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1155/2021/5589075

ProQuest document ID

2508265923

Unidirectional and Bidirectional LSTM Models for Short-Term Traffic Prediction

Jump to:

Full Text

Abstract

Details

Suggested sources